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ABSTRACT 


The  purpose  of  this  research  was  to  develop  a  causal  quantitative 
relationship  between  transportation  development  and  city  development. 

A  generalized  eleven  variable  transportation  city  development  path 
model  was  hypothesized,  and  calibrated  separately  with  1940,  1950,  1960 
and  19  70  collected  data.   Cities  sampled  were  representative  of  all 
non-metropolitan  Indiana  cities  with  population  between  2500  and  50,000. 

The  individual  calibrated  models  sufficiently  explained  the 
variations  of  land  uses  within  the  city,  but  the  four  calibrated  models 
were  different  from  each  other  and  could  not  be  combined  into  one 
model. 

The  diagram  of  calibrated  path  models  and  path  coefficients  were 
used  to  calculate  total  effect  coefficients  for  effect  of  each  signifi- 
cant dependent  variable  on  each  independent  variable  while  controlling 
for  extraneous  causes. 


CHAPTER  ONE 
INTRODUCTION 

A  new  or  improved  transportation  facility  influences  land  develop- 
ment.  The  resultant  land  development  generates  additional  trips,  which 
in  turn  affect  the  operational  characteristics  of  the  transportation 
facility.   The  transportation  facility  and  the  resultant  land  develop- 
ment in  turn  affect  the  economic  development  and  environmental  condition 
of  the  region  and  the  community  (1)*.   Estimation  of  probable  land 
development  associated  with  a  transportation  facility  is  essential  to 
proper  evaluation  of  a  given  transportation  alternative.   Unfortunately, 
it  is  difficult  to  predict  probable  land  development  associated  with 
a  planned  transportation  facility.   As  a  result,  proper  consideration 
has  not  been  given  to  this  criterion  in  the  transportation  system 
evaluation  process.   Generally,  the  resultant  land  development  has  been 
obtained  from  "intuitively  derived  generalities"  (2) . 

The  exact  relationship  of  transportation  and  land  use  is  not 
clear.   For  example,  Kingham  documents  qualitative  evidence  that  roads 
do  control  land  use,  as  well  as  evidence  that  roads  do  not  control  land 
use  (3).   Putman,  et  al,  confirm  existance  of  qualitative  evidence  of 
an  interrelationship  between  highway  development  and  land  development, 
but  finds  no  study  to  show  a  quantitative  causal  relationship  between 


''^Numbers  refer  to  items  in  the  Bibliography. 


land  use  and  transportation.   They  state  "...,  there  does  not  seem  to 
be  any  prospect  of  obtaining  such  evidence  since,  in  the  strictest 
sense  this  would  require,  the  performance  of  a  controlled  experiment 
on  the  real  world"  (4). 

It  was  determined,  from  studies  of  the  available  literature,  that 
tlie  attempt  to  obtain  a  quantitative  causal  relationship  l)c;tween 
transportation  and  land  use  required  an  approach  different  from 
previously  performed  transportation-land  use  studies.   Previous  studies 
generally  tried  to  measure  the  effect  of  change  on  the  land  use  which 
came  about  as  a  result  of  change  in  the  transportation  system.   As  will 
be  more  fully  explained  in  Chapter  2,  this  method,  normally  called  a 
"before-and-af ter  study",  belongs  to  the  family  of  experiments  that 
requires  control.   Basic  to  a  controlled  experiment  is  the  ability  to 
replicate  the  experimental  units.   It  is  not  possible  to  replicate  an 
urban  environment  and  the  subsequent  transportation  change.   Therefore, 
studies  using  variations  of  "before  and  after"  studies  have  not  been 
successful  in  explaining  the  interrelationship  of  transportation  and 
land  use. 

Fortunately,  replication  of  experimental  units  (urban  areas)  is 
not  necessary  for  learning  about  the  generalized  relationship  between 
transportation  and  land  use.   "Quasi-Experimental  Design"  allows 
learning  from  observation  of  experimental  units  without  having  replica- 
tion of  experimental  units.   "Design  of  Experiment"  is  that  part  of 
statistical  science  which  allows,  for  example,  agricultural  scientists 
to  compare  the  quality  of  two  seeds  by  keeping  all  other  conditions 
constant  (by  replicating  experimental  units).   "Quasi-design  of 


Experiment"  uses  the  technique  of  "Design  of  Experiment"  for  occasions 
when  replications  of  experimental  data  are  not  possible. 

Purpose 
The  purpose  of  this  research  is  to  develop  a  generalized  quantita- 
tive causal  relationship  between  transportation  development  and  urban 
development.   This  quantitative  relationship  is  useful  in  the  evalua- 
tion phase  of  the  transportation  process,  as  well  as  in  transportation 
impact  studies. 

Scope 

The  inference  space  of  this  study  was  limited  to  cities  in 
Indiana  with  a  population  greater  than  2500  and  less  than  50,000, 
which  were  not  located  within   standard  metropolitan  statistical  areas 
(SMSA) .   The  data  used  in  this  study  was  collected  from  four  study 
periods  of  1940,  1950,  1960  and  1970. 

The  lower  limit  of  2500  was  chosen  because  certain  census  data 
were  not  available  for  smaller  cities.   An  upper  limit  of  50,000  was 
chosen  because  in  this  study,  land  uses  were  measured  from  aerial 
photographs  in  terms  of  acreage  of  land  use  for  a  given  purpose.   It 
was  felt  that  in  larger  towns,  due  to  density  of  structural  pattern, 
square  footage  of  different  land  uses  had  to  be  measured  instead  of 
land  uses  in  acres.   Measurement  of  floor  space  was  costlier  than 
measurement  in  acres,  and  historical  measurements  of  floor  space  was 
not  readily  available  (although  possible  from  aerial  photographs) . 

The  inference  space  was  also  limited  to  cities  in  non-SMSA 
counties.   This  limitation  was  placed  in  order  to  avoid  dealing  with 


the  problem  of  the  experimental  city  being  adjacent  to  a  larger  city, 
or  in  an  integrated  and  built-up  area. 

The  model(s)  developed  are  aggregate  models  in  the  sense  that  the 
city  itself  is  the  experimental  unit.   The  models  developed  measure 
the  change  in  a  particular  land  use  for  the  city,  as  a  whole,  as  a 
result  of  change  in  the  transportation  system.   The  developed  model  is 
not  sensitive  for  land  use  changes  in  individual  traffic  zones. 
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CHAPTER  TWO 
PREVIOUS  INVESTIGATION 

Transportation  Impact  studies  number  in  the  thousands  (1) .   A  few 
of  these  reports  were  studied  in  the  early  part  of  the  research  (2  to  12) 
These  reports  provided  an  understanding  of  impact  studies  which  would 
have  been  hard  to  obtain  from  studies  of  abstracts  and  literature 
reviews  alone.   The  shear  number  of  transportation  impact  studies,  and 
the  fact  that  many  of  the  studies  are  repetitive,  necessitated  extensive 
use  of  available  state-of-the-art  reviews  of  transportation  impact 
studies  (1,  13  to  20). 

Types  of  Transportation  Impact  Studies 
Transportation  impact  studies  were  categorized  as  to  type  of 
transportation  facilities  studied  to  determine  methodologies  used  and 
results  obtained.   Putman,  et  al,  divide  transportation  impact  studies 
into  the  following  five  general  categories  (21): 

1.  Impact  of  highway  bypasses. 

2.  Impact  of  new  highway  construction  on  a  narrow  corridor. 

3.  Land  development  around  freeway  interchanges. 

4.  Impact  of  major  highway  construction  upon  a  region. 

5.  Relationship  of  transportation  development  and  land  use 
development. 


Hy-Pass  Studies 

Land  use  changes  were  reported  for  many  by-passes.   There  were 
some  excellent  early  bypass  studies.   Unfortunately,  bypass  studies 
become  repetitious,  and  generalized  knowledge  useful  in  prediction  of 
land  use  consequences  of  a  particular  bypass  was  not  available. 

In  19A9-50  a  7.1  mile  long,  free  access,  two-lane  bypass  was 
built  near  Kokomo,  Indiana.   Kokomo  (1950  population  38,600)  is  a 
"typical  raidwestern  city  that  has  grown  and  prospered  in  the  last  one 
hundred  years".   A  1951  study  of  the  Kokomo  bypass  shows  the  land  use 
maps  for  1938  and  1948  obtained  from  aerial  photographs,  and  a  land 
use  map  for  1951  obtained  from  a  field  study.   These  land  use  maps  show 
that  prior  to  1948  the  growth  of  the  city  was  toward  the  south  and 
southwest.   In  1951,  the  growth  continued  toward  the  south  and  southwest, 
but  there  was  also  growth  along  the  bypass  (23) .   A  re-study  of  the 
same  bypass  in  1965  showed  the  shift  in  direction  of  city  growth  from 
the  south  and  southwest  to  the  east  where  the  bypass  was  located  (24) . 

Vargha  (25)  in  a  study  of  218  small  communities  discussed  their 
growth  in  relation  to  highway  bypasses  and  natural  barriers  and  con- 
cluded bypasses  with  limited  access  act  as  natural  barriers.   TiJhen  one 
side  of  the  community  was  contained  by  a  natural  barrier  and  a  limited 
access  bypass  was  contemplated  for  the  other  side,  he  recommended 
enough  land  be  left  between  existing  community  and  the  bypass  for  the 
community  to  grow  to  a  population  size  of  1200  to  2000.   If  this  margin 
of  land  for  potential  growth  was  not  included,  the  community  remains  so 
small  that  it  acts  as  one  unit  and  cannot  cross  the  barrier  of  an 
access-1 1ml  tt'd  byi^ass. 


Skorpa,  et  al,  summarize  the  results  of  land  use  changes  from 
before-and-after  studies  of  four  highway  bypasses  in  the  state  of 
Texas.   They  conclude  that  the  main  effect  of  those  bypasses  was  to 
change  agricultural  land  into  land  held  for  future  use.   Also  they 
state  that  the  effect  of  land  use  changes  was  basically  limited  to  a 
"very  narrow  strip  along  the  facility"  (26) . 

Impact  of  Radial  Highways  on  a  Narrow  Corridor 
Generally,  the  impact  of  radial  highways  was  measured  in  terms  of 
increase  in  land  value  for  property  close  to  the  roadway.   Also,  in 
urban  areas,  the  land  use  generally  was  changed  to  commercial  land  use 
(27).   References  12  to  15  describe  several  of  these  studies.   One 
typical  example  of  this  type  of  impact  study  was  "a  before-and-after" 
study  of  effects  on  land  values  and  land  use  of  5.4  miles  of  the  Dallas 
central  expressway. 

The  Dallas  study,  in  part,  collected  land  use  and  land  values  for 
parcels  close  to  the  expressway,  from  three  different  "bands".   The 
"A  band"  included  properties  abutting  the  expressway's  frontage  road. 
The  "B"  and  "C"  bands  which  were  two  blocks  wide  parallel  the  "A"  band. 
There  was  also  a  control  area.   Land  values  in  the  three  bands  were 
compared  with  land  values  in  the  control  areas.   The  conclusion  reached 
was  that  land  values  adjacent  to  the  expressways  increased  more  than 
land  values  in  the  control  areas  (28) . 

Interchange  Studies 
Skorpa,  et  al,  summarized  the  results  of  studies  of  land  develop- 
ment adjoining  66  Interchanges  in  Michigan,  36  interchanges  in 


Pennsylvania,  221  interchanges  in  North  Carolina  and  102  interchanges 
in  Indiana.   Their  conclusion  was  that  "despite  many  similarities  in 
land  development  of  different  interchanges  locations,  too  little  is 
known  about  the  underlying  factors  to  be  able  to  predict  future  land 
use  at  a  specific  interchange"  (29) . 

Impact  of  Major  Highway  Construction  on  a  Region 
Another  type  of  highway  impact  study  was  the  attempt  to  find  the 
effects  of  major  highway  improvement  on  a  region.   Objectives  of  these 
studies  were  similar  to  that  of  the  current  project.   Unfortunately, 
choice  of  improper  methodology  made  it  difficult  to  obtain  generalized 
knowledge  a|)p  Licablc  to  the  planning  of  future  transportation  projects. 
An  early  example  of  this  type  of  study  was  the  economic  impact  study  of 
Massachusetts  Route  128  (30).   Two  additional  examples  were  studies  of 
impact  of  freeways  in  the  states  of  North  Carolina  (31)  and  Connecticut 
(32) .   The  North  Carolina  study  was  one  of  the  more  ambitious  studies 
of  this  group  and  attempted  to  develop  predictive  land  development 
equations . 

The  North  Carolina  study  attempted  to  measure  the  impact  of  rural, 
suburban  and  urban  freeways  in  the  state  of  North  Carolina  separately. 
Average  traffic  volumes  and  land  use  for  310  miles  of  rural  highways 
with  105  interchanges  as  well  as  171  miles  of  suburban  freeways  with 
76  interchanges  was  inventoried.   The  inventories  and  subsequent 
analysis  lead  to  "several  general  observations  relative  to  the  highway 
facilities  in  the  rural  and  suburban  areas"  (33). 


The  most  comprehensive  analysis  in  the  North  Carolina  study  was 
that  of  land  development  characteristics  of  the  Raleigh  Belcv;ay  (34). 
Individual  multiple  regression  equations  using  "before  and  after"  data 
were  developed  to  predict  the  following:   (1)  the  number  of  housing  and 
apartment  units,  (2)  the  value  of  industrial  developments,  (3)  commercial 

developments,  and  (4)  office  and  institutional  developments  for  Raleigh, 

2 
North  Carolina.   The  value  of  the  coefficient  of  determination  (R  )  for 

2 
the  developed  equations  were  generally  satisfactory  (R  =  0.7  to  0.9). 

The  models  generally  correlate  the  positive  impact  of  the  Raleigh 

beltway  with  adjoining  land  use,  but  do  not  include  consideration  of 

any  possible  negative  impact  of  the  beltway  on  other  parts  of  the 

Raleigh  community.   The  study  shows  that  some  land  values  close  to  the 

beltway  have  increased  from  $3,000  per  acre  to  $50,000  per  acre.   The 

study  also  states  "land  values  have  gone  down  significantly  in  the  core 

area,  and  surrounding  the  core  area,  the  developments  are  of  very  low 

rent  type  facilities  with  a  high  vacancy  ratio"  (35),  but  this 

phenomenon  was  not  considered  in  the  model  development. 

Interrelationship  of  Transportation  and  Land  Development 
There  were  only  a  few  studies  which  attempted  to  find  the  under- 
lying interrelationship  between  transportation  and  land  development. 
One  such  attempt  was  that  of  Bascom,  et  al,  who  after  a  review  of  the 
literature,  hypothesized  an  approximate  causal  model  compromised  of  a 
set  of  five  simultaneous  regression  equations  to  estimate  "secondary 
effect"  of  investment  in  highways  and  sewers  in  urban  development. 
Unfortunately  they  did  not  calibrate  their  hypothesized  model  but 
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reduced  the  formulated  five  equation  model  to  a  single  equation  model 
for  operational  purposes  (37).   The  data  set,  borrowed  from  calibration 
of  the  "Empiric  land  use  model"  in  Boston,  Denver,  Mlnnenpolis-St .  Paul, 

were  then  used  to  calibrate  the  single  equation  model.   The  calibration 

2 
process  resulted  in  a  low  level  of  explanation  (R  =  0.17  to  0.49). 

The  general  conclusion  was  that  the  effect  of  highways  on  single  family 

residential  construction  was  more  significant  than  originally  assumed 

and  the  effect  of  sewer  investment  was  very  important.   However,  because 

of  unstability  of  model  parameters,  the  model  interpretations  were 

questionable  (37). 

Summary  of  Previous  Transportation  Impact  Studies 
and  Their  Methodology 
A  review  of  literature  of  transportation  impacts  on  land  use  has 
not  produced  any  generalized  causal,  quantitative  tools  for  estimating 
land  use  consequences  of  a  transportation  system  change  on  a  community. 
It  is  believed  the  problem  lies  in  the  fact  that  the  same  "before  and 
after"  methodology  is  used  for  one  transportation  impact  study  after 
another.   Another  basic  problem  has  been  the  general  implicit  assumption 
that  the  effect  of  highways  on  land  use  is  limited  to  a  narrow  corridor. 
As  a  result  of  this  last  assumption,  a  general  increase  in  land  values, 
adjacent  to  the  facility,  has  been  measured  as  a  positive  impact. 
There  has  been  no  attempt  to  measure  the  net  highway  effect  by  assessing 
possible  negative  effects  of  the  facility  elsewhere,  particularly  on 
the  central  business  district.   The  difficulty  with  before  and  after 
methodology  and  lack  of  comprehensiveness  in  impact  studies  will  be 
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discussed  in  the  following  section  and  a  framework  for  an  improved 
alternative  will  be  recommended. 

Analysis  of  Transportation  Impact  Studies  Using 
"Before-and-Af ter"  Methodology 

The  method  of  "bef ore-and-af ter"  analysis  was  used  to  measure  the 
effect  of  a  new  or  improved  highway  on  land  values,  land  use  and  other 
socio-economic  variables.   The  variables  of  interest  were  usually 
measured  two  to  five  years  prior  to  facility  improvement  and  then  again 
two  to  five  years  after  the  change.   It  was  recognized  in  most  studies 
that  there  were  other  influences  besides  transportation  variables  which 
affect  the  socio-economic  variables  of  interest.   Therefore,  a  control 
area  was  chosen  to  isolate  the  effect  of  the  transportation  change  from 
the  effects  of  these  exogenous  variables  on  the  study  area.   In  absence 
of  a  suitable  control  area,  multiple  regression  analysis  was  sometimes 
used  to  relate  one  dependent  variable  to  several  independent  variables 
(38). 

Generally,  an  important  goal  of  transportation  impact  studies  was 
to  determine  the  cause  and  effect  relationship  between  transportation 
development  and  land  development  so  as  to  assist  decision  makers  in 
future  planning  of  transportation  facilities.   "In  truth,  policy 
making  bodies  cannot  have  much  more  ability  to  decide  between  alterna- 
tive transportation  investments  after  reading  these  studies  than 
before"  (39) .   The  problem  was  due  to  "completely  inadequate  theoretical 
foundation  and  methodological  approaches"  (40).   Intuitively,  "before- 
and-after"  techniques  may  seem  appropriate  for  measurement  of  a  trans- 
portation impact,  but  their  use  was  the  one  essential  methodological 
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problem  with  transportation  impact  studies.   Also,  the  associated 
choice  of  control  area  as  an  attempt  to  isolate  the  impact  of  transpor- 
tation change  from  that  of  other  exogeneous  influences  was  misleading. 
The  urban  area  subjected  to  transportation  improvement  was  systematically 
subjected  to  the  particular  transportation  change  because  it  was  in 
some  ways  different  than  other  areas,  including  the  control  area. 

The  use  of  "before-and-af ter"  methodology  to  measure  the  impact  of 
a  new  or  improved  transportation  facility,  at  least  implicitly,  assumes 
that  transportation  change  affords  a  unique  opportunity  for  performing 
an  "experiment"  to  determine  the  effects  of  the  transportation  change 
upon  the  community.   The  above  assumption  was  in  the  tradition  of 
"design  of  experiment".   Without  explicitly  stating  it  a  "before-and- 
after"  study  assumes  that  the  transportation  improvement  was  a  "treat- 
ment", and  that  the  affected  area,  be  it  a  narrow  strip  of  land,  the 
whole  urban  area,  or  an  entire  region,  was  an  "experimental  unit". 
Essential  to  the  methodology  of  "design  of  experiment"  was  the  ability 
to  replicate  the  "experimental  units". 

In  the  physical  sciences  "control"  of  experimentation  was  readily 
available.   In  such  sciences  as  agriculture  and  biology  as  well  as  many 
aspects  of  engineering  and  manufacturing,  "control"  of  experiment  was 
obtained  through  random  assignment  of  treatments  into  replicated 
experimental  units.   Transportation  impact  studies  present  a  greater 
challenge  than  the  above  studies  because  if  is  not  possible  to 
replicate  experimental  units.   An  urban  environment  is  very  hetrogenous 
and  it  is  not  possible  to  divide  urban  areas  into  similar  experimental 
units.   Also,  treatment  is  not  assigned  randomly  to  the  "experimental 
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units",  "for  the  treatment,  the  transportation  change  is  always  de- 
signed to  relate  systematically  to  some  aspects  of  distribution  of 
activities  and  residents"  (41).   In  the  above  discussion  "before-and- 
after"  methodology  is  criticized  on  a  theoretical  basis.   It  is  useful 
to  envision  the  potential  of  learning  from  "before-and-af ter" 
methodologies,  for  the  moment  assuming  that  the  theoretical  difficulties 
are  overcome.   Figures  1  and  2  show  two  possible  scenarios  for  time 
variations  of  one  socio-economic  variable  with  and  without  transporta- 
tion improvement. 

The  scenario  represented  in  Figure  1  yields  more  information  in 
conjunction  with  "before  and  after"  study  than  many  other  possible 
scenarios,  including  the  one  shown  in  Figure  2.   In  these  Figures, 
curve  X  represents  the  time  value  of  one  variable  as  influenced  by  the 
transportation  improvement,  and  curve  Y  represents  the  corresponding 
value  of  the  same  variable  without  the  transportation  improvement. 
Point  C  represents  the  time  at  which  the  transportation  change  took 
place  and  points  B  and  A  represent  times  of  "before"  and  "after"  data 
collection. 

From  measurement  of  the  variable  of  interest  (for  example,  average 
land  value)  before  (B)  and  after  (A)  the  study  area  is  subjected  to  the 
particular  transportation  change,  the  quantity  X  -Y  is  obtained.   In 
the  case  of  the  scenario  presented  in  Figure  1,  the  quantity  ^."Y  is 
positive,  but  unfortunately  this  increase  is  not  attributed  to  the 
change  in  transportation,  because  the  transportation  variable  is 
hopelessly  "confounded"  with  other  variables.   Variables,  besides 
transportation,  which  may  affect  urban  growth  are  numerous.   For 
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Figure  1.   Time  Path  for  an  Impact  Variable  -  Alternative  1 


Source:   Charles  River  Associates,  Inc.,  Measurement  of  the  Effects  of 
Transportation  Chan?;es,  Prepared  for  Department  of  Transporta- 
tion, Sept.  1972. 
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Time 


Figure  2.   Time  Path  for  an  Impact  Variable-  Alternative  2 


Source:   Charles  River  Associates,  Inc.,  Measurement  of  the  Effects  of 
Transportation  Changes,  Prepared  for  Department  of  Transporta- 
tion, Sept.  1972. 
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example,  economic  condition  of  the  region,  population,  educational 
opportunities  and  location  of  the  urban  area  probably  have  significant 
effects  on  the  measured,  X  -Y„,  quantity.   The  effect  of  transportation 
change  on  the  measured  variable  could  have  actually  been  negative,  and 
the  positive  value  measured  may  be  due  to  exogenous  variables. 

To  overcome  the  difficulty  caused  by  "confounding"  of  transporta- 
tion impact  with  other  variables,  many  researchers  have  chosen  a 
"control  area"  in  conjunction  with  "before-and-af ter"  design  to  separate 
the  impact  of  transportation  improvement  from  the  impact  of  other 
variables.   The  "control  area"  is  chosen  for  its  similarity  with  the 
"study  area"  in  all  aspects  except  for  the  fact  that  transportation 
improvement  takes  place  in  the  "study  area",  not  in  the  "control  area". 
It  is  then  assumed  that  exogenous  variables  affect  both  "control  area" 
and  "study  area"  in  an  equal  manner,  and  any  measured  differences 
between  the  value  of  the  variables  of  interest  in  the  "study  area"  and 
the  "control  area"  are  due  to  transportation  change. 

Assuming  for  now,  that  the  proper  "control  area"  as  described  in 

the  above  paragraph  is  selected.   Then  the  magnitude  of  change  in  the 

variable  of  interest,  without  transportation  change  is  measured.   In 

Figure  1,  this  value  is  represented  by  quantity  ""'."'"^u  which  allows 

adjustment  of  the  previously  measured  quantity,  ^  -Y   to  obtain  the 

quantity  X.-Y.  which  is  the  net  change  in  the  variable  at  the  time  A. 
^      -^   A  A 

This  last  quantity  shows  whether  the  transportation  impact  is  negative 
or  positive,  but  it  is  not  a  measure  of  magnitude  of  the  transportation 
impact,  because  transportation  impact  is  represented  by  the  dashed 
area  in  Figure  1. 
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The  scenario  presented  in  Figure  1  is  ideal  for  yielding  informa- 
tion in  conjunction  with  "before-and-af ter"  methodology.   For  example, 
the  situation  presented  in  Figure  2   does  not  allow  measurement  of  the 
impact  of  transportation  improvement  at  all,  because  the  impact  of  the 
transportation  change  was  temporary  and  the  interval  between  transpor- 
tation change  (C)  and  the  observation  (A)  was  too  long  (41)  .   The  above 
analysis  shows  that  the  maximum  amount  that  is  learned  from  traditional 
"before-and-af ter"  studies  with  proper  control  areas  is  whether  the 
transportation  impact  is  positive  or  negative. 

Urban  areas  are  heterogenous.   The  fact  that  the  urban  area  is 
subjected  to  the  transportation  change,  and  the  control  area  is  not,  is 
proof  enough  that  the  two  urban  areas  are  different  from  each  other. 
Two  urban  areas  at  one  point  in  time  seem  to  be  similar  in  some  aspects 
but  most  probably  these  cities  take  diverse  growth  patterns  (42) . 
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CHAPTER  THREE 

A  FRAMEWORK  FOR  DEVELOPING  THE  INTERRELATIONSHIP  OF 

TRANSPORTATION  DEVELOPMENT  AND  CITY  DEVELOPMENT 

The  difficulty  discussed  in  previous  transportation  studies  arose 
due  to  lack  of  "control"  over  the  experimental  conditions.   It  is 
possible  to  learn  about  the  impact  of  transportation  without  the 
performance  of  experiments.   Whereas  in  agriculture  science,  replica- 
tion of  experiment  is  necessary,  in  transportation  impact  analysis: 

"We  must  learn  principally  from  observation  without 
experimentation.   We  are  closer  to  the  situation  of  geo- 
physicists  postulating  and  testing  the  theory  of  continental 
drift  than  to  Fisher  studying,  designing,  and  interpreting 
agricultural  experiments"  (1) . 

"Quasi-experimental"  research  design  allows  learning  from  observa- 
tions wliere  formal  scientific  "design  of  experiment"  is  not  possible. 
"Quasi-experimental"  design  uses  techniques  of  "design  of  experiment" 
to  allow  certain  statistical  control  of  the  phenomenon  under  study 
where  deliberate  randomization  of  "experimental  units"  and  treatments 
are  not  possible.   Campbell  and  Stanley  developed  this  design  for  use 
in  the  field  of  education  (2),  and  Charles  River  Associates,  Inc. 
extended  Campbell  and  Stanley's  finding  to  transportation  impact 
problems. 

Randomization  schemes  in  "design  of  experiment"  allow  tests  of 
significance  for  a  given  treatment  for  the  chosen  "Inference  space". 
The  "quasi-experimental"  designs  are  sampling  schemes  which  allow  choice 
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of  samples  representing  the  "inference  space".   The  variable  of 
interest  is  measured  from  each  chosen  sample,  which  in  turn  allows 
general  statements  in  regard  to  significance  of  the  measured  variable 
over  the  "inference  space". 

In  this  chapter  "quasi-experimental"  design  was  used  to  develop 
an  alternative  framework  to  "before  and  after"  design  for  impact  of 
transportation  facilities.   The  proposed  framework  does  not  measure 
the  change  brought  about  as  a  result  of  a  change  in  transportation,  but 
attempts  to  develop  a  model  to  explain  the  relationship  of  transporta- 
tion development  and  city  development.   The  ability  to  explain  this 
relationship  can,  in  turn,  be  used  to  choose  between  transportation 
alternative  by  predicting  the  consequence  of  implementation  of  a  given 
alternative. 

The  proposed  framework  for  establishing  a  relationship  between 
transportation  development  and  city  development  is  conceptually  simple, 
and  applicable  to  the  study  of  a  wide  range  of  impact  problems. 

The  following  four  steps  are  sequential: 

1.  Development  of  structural  transportation  model(s)  which 
specify  relationships  among  a  set  of  variables  based  on 
explicit  hypothesis. 

2.  Based  on  the  principles  of  "quasi-experimental"  design, 
samples  representing  the  desired  "inference  space"  are  obtained 
and  the  values  of  the  variables  in  the  model  are  measured. 

3.  Measured  variables  are  used  to  calibrate  the  structural  model. 

4.  The  model  is  tested. 
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The  framework  for  development  of  a  model  which  explains  the  relation- 
ship between  transportation  development  and  city  development  is  de- 
tailed in  the  following  sections. 

Use  of  Path-Analysis  for  Verification  of  Relationship  Between 
Transportation  Development  and  City  Development 

The  task  is  to  specify  a  structural  model  which  explains  the 
interrelationship  of  transportation  development  and  city  development. 
Wold  (3)  describes  requirements  for  "explanation"  based  on  observed 
"quasi-experimental"  (or  as  Wold  calls  it,  "non-experimental")  data. 
For  further  clarification,  it  is  useful  to  study  the  position  of 
"explanation"  from  "observed"  data  in  the  realm  of  the  science  of 
applied  statistics.   In  Wold's  terminology,  there  are  two  "means"  and 
two  "ends"  in  applied  statistics  resulting  in  "Four-Fold  Map  of  Ends 
and  Means",  shown  in  Figure  3.   One  of  the  two  "means"  is   statistical 
data,   obtained  from  "experimental"  designs,  and  the  other  is 

statistical  observations,   obtained  from  "non-experimental"  designs. 
The  two  "ends",  or  purposes  of   statistical  inference,   are  "description" 
and  "explanation"  (A). 

Clarification  of  the  distinction  between  descriptive  statistics  and 
explanatory  statistics  is  useful.   Descriptive  statistics,  which  are 
limited  to  the  northern  sector  of  Figure  3,  deal  with  the  questions  of 
"How"?  or  "How  much"?,  while  explanatory  statistics  limited  to  the 
southern  half  "treat  the  question  'Why'?  and  ...  the  answer  involves 
an  element  of  causal  inference"  (5).   Descriptive  statistics  is 
sufficient  to  maintain  control  over  current  operations,  but  for  improve- 
ment of  the  operations,  explanatory  statistics  are  required.   A  good 
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Figure  3.   Four  Fold  Map  of  Ends  and  Means  in  Applied  Statistics 

Source:   Wold,  Herman  "Causal  Inference  from  Observational  Data:   A 
Review  of  Ends  and  Means",  Journal  of  Royal  Statistical 
Society,  Sec.  A.,  1956,  pp.  28-51. 


24 


interpretation  of  distinction  is  credited  to  Dr.  R.  Porter  "...  descrip- 
tion is  employed  as  an  aid  in  tlie  human  'adjustment'  to  given  condition, 
while  explanation  is  a  vehicle  for  'ascendancy'  over  the  environment" 

(6). 

Prior  to  the  20th  century,  statistics  was  largely  limited  to  a 

description  based  on  "non-experimental"  data,  while  the  classic  and 

modern  statistics  developed  in  this  century  was  largely  interested  in 

description  and  explanation  by  use  of  "experimental"  data  (NE  and  SE 

sectors)  (7).   According  to  Wold: 

"The  epoch-making  contributions  of  R.  A.  Fisher  brought 
a  powerful  expansion  in  the  direction  of  experimental 
methods,  with  emphasis  in  the  SE  sector.   The  exact  and 
efficient  methods  developed  by  Fisher  and  his  followers 
have  led  to  great  triumphs  in  biological  sciences,  medicine, 
engineering  and  other  fields  where  research  can  be  based  on 
controlled  experiments..."  (8). 

The  progress  in  the  SW  sector  is  slow  because  "explanation"  on  the 

basis  of  "quasi-experimental  data"  is  more  difficult  than  "explanation" 

based  on  "experimental  data".   Unlike  the  progress  in  the  SE  section, 

brought  by  Fisher's  formalized  and  generalized  statements  applicable 

to  all  fields,  the  progress  in  the  SW  section  has  to  come  separately 

for  each  field.   With  "non-experimental"  data  "the  statistical  analysis 

has  to  be  closely  coordinated  with  the  subject  matter,  both  in 

specifying  the  causal  hypothesis,  and  testing  them  against  other  sources 

of  knowledge"  (9).   The  specification  of  the  causal  model  is  based  on 

subject  matter  theory  to  "test  or  demonstrate  the  causal  assumptions" 

(10). 

The  general  structural  model  chosen  in  this  research  project  to 

explain  the  relationship  of  transportation  and  city  development  was  the 
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"path  model".   Path  analysis  was  first  developed  by  geneticist  Wright 
in  1918,  and  popularized  in  sociology  by  Duncan  in  1966  (11).  Path 
analysis  was  a  useful  tool  in  the  applied  statistics  where  the  objective 
was  "explanation"  from  "quasi-experimental"  data  (SW  sector  of  Figure  3) . 
The  path  model,  as  normally  used,  is  a  system  of  recursive  regression 
equations.   There  are  several  advantages  in  the  use  of  the  path  model. 
One  advantage  of  path  analysis  is  that  the  assumptions  made  are 
explicit,  which  in  turn  forces  the  discussion  to  be  internally  con- 
sistent, so  that  incompatible  assumptions  are  not  incorporated  within 
the  model.   Another  advantage  of  the  path  model  is  that  effects  of  one 
variable  on  another  variable  are  divided  into  direct  and  indirect 
effects.   This  in  turn  allows  a  measure  of  the  total  effect  of  one 
variable  on  another. 

Path  analysis,  as  developed  by  Wright,  is  concerned  with  linear, 
additive,  and  causal  relationships  among  a  set  of  variables  which  are 
measured  by  an  "interval  scale".   Although  the  occasional  use  of  a 
dummy  variable  is  allowed,  "the  hypothetical  variables  (dummy  variables) 
are  conceived  as  continuous  variables"  (12).   The  assumption  of  a 
causal  relationship  is  the  only  assumption  not  required  of  the  usual 
statistical  model.   "Causal  relationship"  in  path  analysis  means  that 
a  "weak  causal  order"  exists  between  the  set  of  variables,  and,  the 
relationship  among  these  variables  are  "causally  closed".   "Weak  causal 
order"  between  variables  X  and  Y  (X-»-Y)  means  that  X  can  affect  Y,  but 
Y  cannot  affect  X.   By  "causal  closure",  it  is  meant  that  given  a  bi- 
variate  covariation,  and  weak  causal  ordering  between  X  and  Y,  then  the 
observed  covariation  between  X  and  Y  is  due  (1)  to  causal  dependence  of 
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Y  on  X,  (2)  to  the  mutual  dependence  of  X  and  Y  on  other  variable(s), 
or  (3)  to  the  combination  of  the  above  two  causes  of  variations  (13). 

The  diagrammatic  representation  of  the  set  of  structural  equations 
which  is  based  on  the  above  assumptions  is  called  a  "path  diagram". 
A  "path  diagram"  is  useful  in  understanding  the  assumptions  made  in  the 
"path  model",  because  there  is  a  one  to  one  relationship  between 
statistical  and  algebraic  properties  of  the  structural  equations  and 
the  "path  diagram"  ( 1^) .      The  following  four  conventions  are  used  for 
presentation  of  a  "path  diagram": 

1.  Uni-directional  straight  arrows  identify  "weak  causal 
ordering"  of  variables.   Straight  arrows  are  drawn  from  each 
"determining  variable"  to  each  variable  dependent  on  it. 

2.  The  non-causal  correlated  relationship  between  two  independent 
variables  is  shown  with  a  two  headed  curvilinear  arrow. 

3.  The  portion  of  variation  of  any  dependent  variable  not  ex- 
plained within  the  system  is  represented  with  a  "residual 
variable".   The  "residual  variables"  are  also  shown  with  a 
unidirectional  arrow. 

4.  Beside  each  unidirectional  arrow  the  value  of  "path  coefficient" 
(P..)  is  entered.   The  first  subscript  j  denotes  the  dependent 
variable  and  the  second  subscript,  i,  represents  the  independent 
variable. 

A  simple  example  of  the  "path  model"  of  Figure  Ua   is  offered  for 
visualization  of  the  above  conventions,  and  to  demonstrate  the  technique 
of  "path  analysis".   The  "path  estimation  equations"  are  derived  from 
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Figure  Aa.   Example  of  Path  Model  with  Unnormalized  Variables  and 
Path  Regression  Coefficients 


Figure  4b.   Example  of  Path  Model  with  Normalized  Variables  and 
Path  Coefficients 
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the  structural  equations  of  the  "path  model"  by  use  of  the  "path 
analysis"  technique  (15).   The  example  also  shows  how  the  total  effect 
of  an  independent  variable  on  a  dependent  variable  is  obtained.   The 
general  method  used  in  conjunction  with  solution  of  tlie  four  variable 
model  of  Figure  4a  is  directly  applicable  to  the  much  larger  model, 
which  is  used  in  this  research.   But  the  solution  of  this  small  model 
more  conveniently  demonstrates   tlie  technique  of  "path  analysis". 

Given  the  four  variables  X  ,  X  ,  X  ,  X,,  and  the  "path  model"  of 
Figure  4a,  the  structural  equations  are: 

X^  =  K^  ,  X,  +  K„  „  X„  +  K^    E.  eq.  la 

3  3,11    J,/z    3,eJ 

X,  =  K,  ,  X,  +  K.  „  X„  +  K.  „  X„  +  K,  ^  E  eq .  lb 

4  4,11    4,2  2    4,3   3    4,e4 

The  K  coefficients  are  known  as  path  regression  coefficients,  and 

are  obtained  by  the  ordinary  method  of  regression  equation.   For 

example,  the  value  of  K_  ^  is  the  same  as  the  regression  coefficient 

J  >  -i 

obtained  by  regressing  X_  on  X^ ,  when  X„  is  already  in  the  equation. 

This  last  statement  in  regression  terminology  is  written  as  3o-|  o* 

If  the  mean  and  variance  of  the  variable  X.  are  M.  and  a.,    then 

111 

Z.  =  (X.-M.)/a.  represents  the  variable  in  the  form  of  its  "standard 
1     11   1 

score".   "Path  coefficient".  P..,  between  two  standardized  variables 
with  subscripts  i  and  i  is  defined  as  P .  .  =  K.  .(o/o.).      Then  by 
substitution  equation  la  and  lb  are  transformed  to  equation  2a,  and  2b. 


^  =  ^3.1  ^1-^^3,2  h-''2,e   h  '^^^  '" 


Z.  =  P.    7,  +P,  .  Z„  +  P.  ^  Z_  +  P,    E.  eq.  2b 

4    4,11    4,2   2    4,3   3    4,e4 
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The  actual  values  of  the  "path  coefficient"  is  calculated  in  the  same 
way  as  path  regression  coefficients,  but  the  variables  are  in 
standardized  form.   Figure  4b  represents  the  path  model  with  standard- 
ized variables  and  path  coefficients.   The  value  of  residuals  for 


var 


/   2         2 
iable  Z.  is  /1-R  ;  where  R  is  the  value  of  the  "coefficient  of 


determination"  obtained  when  Z.  is  the  dependent  variable,  and  all 
independent  causal  variables  are  in  the  equation. 

Path  coefficients  are  statistically  tested  in  accordance  with  back- 
ward elimination  method.  In  case  the  value  for  one  of  the  coefficients  is 
negligible  the  corresponding  path  is  removed  from  the  "path  diagram",  and 

the  process  is  repeated  until  all  the  remaining  path  coefficients  are 
significant. 

Total  Effect  of  an  Independent  Variable  on  a  Dependent  Variable 

To  obtain  the  "total  effect"  of  variable  Z.  on  variable  Z.  within 

1  J 

the  "path  model",  let's  determine  equations  of  type: 


Z.  =  C.  .  Z.  eq.  3 

C.  is  a  constant  standing  for  magnitude  of  change  in  Z.  for  a 
unit  change  (i.e.  a  one  standard  deviation  change)  in  Z . ,  while  con- 
trolling for  extraneous  causes.   For  the  path  model  of  Figure  4b: 


S.l  =  ^3,1  ^^-  ^" 


S,2  =  ^3,2  ^^-  ^^ 


^,2  =  ^4,2-^^,2^4,3  ^^-  ''' 

^4.1  =  ^4,1-^^3.1^4,3  ^^-  '^ 
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The  value  of  C,  „  obtained  In  equation  Ad,  represents  the  total 

effect  of  variable  Z„  on  variable  Z,.   This  total  effect  is  a  summation 

2  4 

of  the  direct  effect  of  variable  Z„  on  variable  Z,  and  an  indirect 

2  4 

effect  of  variable  Z   on  variable  Z  ,  thru  variable  Z  .   The  interpreta- 
tion of  the  term  C,  „  is  that  for  one  standard  deviation  change  in 

variable  X„ ,  variable  X,  is  changed  C,  „  standard  deviation.   The  con- 
2  4  4,2 

cept  is  very  helpful  in  a  more  complex  model  where  there  are  many  in- 
direct effects,  making  intuitive  assessment  of  total  effects  almost 
impossible. 

The  above  explanation  of  "path  analysis"  is  a  mechanical  descrip- 
tion of  the  subject  which  is  highly  dependent  on  subject-matter 
assumptions  for  its  usefulness.   In  the  following  paragraphs,  a  "path 
model"  is  hypothesized  to  explain  the  relatlonsliip  between  transporta- 
tion and  city  development.   In  the  following  chapters  the  method  of 
"path  analysis"  is  incorporated  in  a  quasi-experimental  design  to 
calibrate  the  hypothesized  model. 

Path  Model  Hypothesized  to  Explain  Relationship  Between 
Transportation  Development  and  City  Development 

Development  of  a  "path  model"  to  explain  the  relationship  between 
transportation  development  and  city  development  depends  upon  subject 
matter  theory  and  assumptions. 

The  most  essential  assumption  in  the  path  model  developed  in  this 
project  is  that  transportation  development  influences  city  development. 
This  assumption  is  based  on  transportation  "supply  and  demand"  theory, 
which  states  in  the  long  run  the  transportation  improvement  is  influenced 
by  major  investment  decision  making. 
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Transportation  "supply  and  demand"  theory  which  is  based  on  wel- 
fare economics  is  discussed  elsewhere  (16) .   A  simplified  explanation 
of  this  theory  with  help  of  the  diagrammatic  presentation  of  Figure  5 
is  given  here.   Curves  DD  and  SS  represent  the  initial  demand  and  supply 
functions.   Tlic  Initial  equilibrium  is  at  Point  E  ;  with  a  trip  volume 
of  T  and  corresponding  trip  cost  of  C  .   A  transportation  improvement 
shifts  the  supply  function  to  the  right,  the  the  S  S  position.   The 
shift  of  the  supply  function  to  the  right  if  the  demand  for  travel 
remains  the  same  as  the  pre-improvement  period  causes  a  much  lower 
travel  cost  C„,  but  this  is  an  unstable  state  and  shortly  after  the 
improvement  drivers  divert  from  other  routes  to  take  advantage  of  the 
lower  travel  cost  in  the  improved  route  until  equilibrium  E,  with 
corresponding  trip  volume  T.  and  trip  cost  C^  is  reached.   In  the  long 
run  though,  due  to  lower  travel  cost  C  ,  investment  decisions  are  made 
to  relocate  or  initiate  nevr  construction  to  take  advantage  of  the  im- 
proved transportation  facility.   These  investment  decisions,  in  turn, 
cause  the  demand  curve  to  shift  to  the  right  which  means  a  higher  travel 
cost  for  a  given  trip  volume.   Eventually  the  equilibrium  point  between 
a  new  supply  function  and  a  new  demand  function  is  reached  at  point  E 
with  the  corresponding  trip  volume  T  and  cost  C.   The  consequence  of 
this  theory  is  that  transportation  development  influences  city  develop- 
ment. 

The  path  model  developed  in  this  research  project  is  based  on  the 
eight  specific  assumptions  listed  in  Table  1.   Assumptions  one  and  two 
partially  follow  from  transportation  supply  and  demand  theory. 
Assumptions  three  thru  eight  are  not  based  on  any  one  specific  source. 
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Figure  5.   Diagrammatic  Presentation  of  Transportation  Supply  and 
Demand  Theory 
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Table  1.   Assumptions  Used  in  Development  of  the  Path  Model  Describing 
the  Relationship  of  Transportation  Development  and  City 
Development 


No.  Description  of  Assumption 

1  City  location  and  inter-city  transportation  system 

influence  intra-city  transportation  development. 

2  Intra-city  transportation  development  influence 

industrial  development. 

3  Industrial  development  influences  employment. 

4  Employment  influences  mean  family  income. 

5  Mean  family  income  influences  total  population. 

6  City  population  influences  commercial  land  use. 

7  Vacant  land  use  influences  institutional  land  use. 

8  Institutional  land  use  influences  residential  land 

use. 
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but  come  from  the  author's  understanding  of  the  cities  growth  process; 
although  assumptions  three  to  five  are  close  to  assumptions  made  in 
"Lowery"  type  urban  development  models  which  assume  basic  employment 
in  the  long  run  determines  city  population  (17) . 

The  path  model  based  on  the  assumptions  of  Table  1  is  shown  in 
Figures  6a  and  6b.   The  conventions  used  in  these  diagrams  are  the  same 
as  those  explained  in  conjunction  with  Figures  4a  and  Ab .   Figure  6a 
which  corresponds  with  Figure  Aa  shows  assumed  relationship  between  the 
actual  variables  and  the  associated  path  regression  coefficients;  while 
Figure  6b  shows  the  normalized  variables  and  the  path  coefficients. 
The  original  form  of  the  path  model  is  more  convenient  for  the  verifica- 
tion of  the  assumed  links  and  prediction  of  various  land  uses;  while  the 
second  form  of  tlie  model  is  more  convenient  for  calculation  of  the  over- 
all effects  of  one  variable  on  another. 

The  operational  form  of  the  path  model  corresponding  with  the  path 
diagram  of  Figure  6a  is  shown  by  the  nine  sequential  regression  equations 
6a  to  6h  shown  in  Figure  7a.   The  coefficient  of  these  equations  then 
are  path  regression  coefficients  which  upon  verification  are  used  to 
predict  the  endogenous  variables.   The  mathematical  form  of  the  path 
model  with  normalized  coefficients  and  path  coefficients  is  presented 
by  sequential  regression  equations  7a  to  7h,  shown  in  Figure  7b.   After 
verification,  these  path  coefficients  are  used  to  determine  the  overall 
effect  of  causal  variables  on  dependent  variables.   If  any  of  the  path 
coefficients  prove  to  be  non-significant,  then  the  corresponding  link 
on  the  path  diagrams  of  Figures  6a  and  6b  is  erased  and  the  resulting 
new  model  analyzed  for  further  verification. 
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Table  2.   Designation  and  Names  of  Variables  Used  in  the  Transportation- 
Urban  Development  Path  Model 


Variable  Designation 


Variable  Name 


City  location  * 
Inter-city  accessibility  * 
Intra-city  accessibility  * 


Industrial  land  use  ** 
Employment  per  1000  persons  * 
Median  family  income  * 


City  population  * 
Commercial  land  use  ** 
Vacant  land  use  ** 


10 


11 


Institutional  land  use  ** 
Residential  land  use  ** 


*  exogenous  variables 
**endogenous  variables 
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The  transportation-urban  development  path  model  contains  11 
variables.   These  variables  are  listed  in  Table  2  for  purposes  of 
identification  of  model  variables.   Detailed  definitions  of  these 
variables  are  sensitive  to  the  specific  data  measurement  techniques. 
These  11  variables  as  used  in  this  study  are  further  defined  in  the  data 
collection  section. 

It  should  be  noted  at  this  time  that  these  variables  fall  into 
two  categories  of  endogenous  and  exogenous  variables.   It  is  the  purpose 
of  the  model  to  explain  the  variations  of  endogenous  variables;  while 
the  values  of  the  exogenous  variables  are  assumed  to  be  known.   The 
purpose  of  inclusion  of  the  exogenous  variables  in  the  model  is  to 
assist  in  explanation  of  the  variations  of  the  endogenous  variables. 
The  purpose  of  this  model  is  to  explain  variations  of  various  land  uses 
for  a  given  socio-economic  and  transportation  system.   Therefore 
residential  land  use,  institutional  land  use,   vacant  land  use, 
commercial  land  use,  and  Industrial  land  use  are  the  real  dependent 
(endogenous)  variables  and  the  city  location  inter-city  accessibility, 
intra-city  accessibility,  employment  rate,  median  family  income,  and 
population  are  really  the  set  of  independent  (exogenous)  variables  to 
assist  in  the  explanation  of  variations  of  the  dependent  variables. 
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CHAPTER  FOUR 
DESIGN  OF  QUASI-EXPERIMENT 

The  design  of  this  quasi-experiraent  involves  observations  and 
statistical  analysis  of  four  series  of  cross-sectional  data  to  test  a 
hypothesized  relationship  between  transportation  development  and  urban 
development.   In  a  traditional  design  of  experiment  the  significance 
of  a  treatment  is  obtained  by  subjecting  only  some  of  the  experimental 
units  to  a  given  treatment  and  then  comparing  the  experimental  units 
subjected  to  the  treatment  with  the  experimental  units  not  subjected 
to  the  treatment  to  test  for  significance  of  the  particular  treatment. 
In  the  quasi-experimental  design  such  control  of  treatment  and 
experimental  units  is  not  possible,  but  the  control  is  obtained  by 
randomly  sampling  an  experimental  unit  and  then  measuring  the  unique 
treatments  observed  within  the  sampled  experimental  unit.   This 
sampling  method  is  very  useful  for  perspective  studies  of  urban 
phenomenon  because  it  does  not  require  a  prior  categorization  of  cities 
as  to  different  treatment,  but  allows  for  observations  of  actual  treat- 
ments after  random  selection  of  the  city  as  the  experimental  unit.   The 
method  still  allows  stratification  of  sample  universe  into  data  cells 
for  other  than  non-treatment  variables.   The  sampling  scheme  for  choice 
of  cities  to  be  observed  for  one  cell  thru  time  is  shown  in  Figure  8. 
The  nomenclature  used  in  this  figure  is  that  of  Stanley  and  Campbell  (1) 
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Figure  8.   Sampling  Scheme  for  Calibration  of  Transportation-City 
Development  Path  Model 
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Each  line  of  this  figure  represents  the  process  by  which  one  city  is 

selected  and  observed.   Letter  R  signifies  the  randomness  of  the 

process.   Letter  X   signifies  that  this  city  is  subjected  to  a  unique 

set  of  treatments  (X  ) ,  and  0   signifies  that  the  particular  treat- 

A  A. 

raents,  X.,  are  observed  and  measured. 
A 

Procedure  to  Select  Sample  Cities 
The  "quasi-experimental"  design  chosen  to  obtain  observed  data  to 
calibrate  the  transportation-urban  development  path  model  of  equations 
6a  to  6h  is  "cross-sectional"  sampling.   Samples  of  all  Indiana  cities 
in  the  inference  space,  in  random  within  strata  of  thirty-six,  are 
selected,  and  then  values  of  the  eleven  variables  in  the  path  model  for 
each  of  the  sample  cities  are  measured. 

As  a  means  of  obtaining  cross-sectional  samjjles  of  cities  on  the 
inference  space,  the  cities  are  stratified  across  four  time  periods, 
throe  geographic  locations,  and  three  population  sizes.   The  resulting 
36  cells;  nine  in  eacli  of  the  four  time  blocks  are  shown  in  Figure  9. 
Random  sampling  of  cities  in  each  of  these  36  cells  and  aggregation  of 
the  selected  cities  in  each  of  the  nine  cells  of  a  time  block  results 
in  the  four  lists  of  independent  cross-sectional  samples. 

There  were  several  reasons  for  sampling  cities  in  1940,  1950,  1960 
and  1970  periods.   The  first  sampling  period  of  1940  was  chosen  because 
for  smaller  cities  some  socio-economic  variables  first  became  available 
in  the  1940  census  publications.   Also,  1940  was  prior  to  the  accelerated 
move  of  city  dwellers  to  suburbia,  as  well  as  relatively  heavy  demand  on 
and  investment  for  the  transportation  system  brought  about  by  the  second 
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world  war.   Choice  of  decinal  years  was  to  enable  direct  use  of  Bureau 
of  Census  decinal  socio-economic  data  in  the  model  calibrations. 

The  choice  of  1970  as  the  last  sampling  period  was  due  to  difficulty 
of  obtaining  uniform  data  for  any  later  date;  as  well  as  the  desire  for 
the  same  time  differences  between  the  sampling  periods. 

Because  of  the  assumption  that  city  location  was  a  factor  in  the 
city  development  the  State  of  Indiana  was  stratified  into  three  zones 
to  assure  representative  city  samples  from  different  geographic 
locations.   Northern  Indiana  was  prime  agricultural  land  and  manu- 
facturing more  so  than  the  more  hilly  southern  section.   Generally  it 
is  assumed  that  the  dividing  line  between  north  and  south  is  U.S.  40 
running  thru  Indianapolis.   But  it  was  decided  to  choose  an  intermediate 
central  zone  to  provide  a  gradual  transition  between  the  northern  and 
southern  zone.   Also,  central  Indiana  is  unique  because  it  includes 
Indianapolis  which  is  the  state's  largest  city  and  a  nationally  unique 
transportation  hub  for  a  city  of  its  size;  in  part  because  of  the  four 
interstate  highways  passing  thru  the  city.   The  exact  boundary  of  the 
three  locational  zones  follow  the  boundary  lines  of  planning  and  devel- 
opment regions  published  by  the  Indiana  State  Planning  Services  Agency 
(2)   as  shown  in  Figure  10.   Aggregation  of  several  counties  into  a 
development  region  implies  that  the  counties  have  in  common  similar 
resources  and  planning  goals.   If  the  city  location  in  this  experiment 
proved  significant,  then  each  of  the  three  geographical  zones  could  be 
further  disaggregated  to  the  level  of  Indiana  planning  and  development 
regions . 


46 


Figure  10.   Division  of  State  of  Indiana  Into  Three  Zones 


Source  for  the  base  map  showing  Indiana  Planning  and  Development 
Regions:   (Indiana)  State  Planning  Services  Agency,  "A  Directory  of 
Indiana  Local  and  Regional  Planning  and  Development  Commissions", 
Indianapolis,  June  1975,  p.  5. 
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The  sample  space  was  further  stratified  in  accordance  to  the 
cities  population  size.   It  has  been  already  explained  that  the  scope 
of  the  experiment  was  limited  to  cities  with  less  than  50,000  popula- 
tion (not  located  in  any  SMSA  county)  and  greater  than  2,500.   The 
cities  between  2,500  and  50,000  were  further  stratified  to  three  levels 
of  2,500-10,000;  10,001-20,000;  20,001-50,000.   Stratification  according 
to  population  was  necessary  because  there  were  many  more  smaller  cities 
than  the  larger  ones  and  without  stratification  an  unbiased  cross- 
sectional  sample  could  not  be  obtained.   The  particular  boundary  line 
of  20,000  was  chosen  to  assure  a  minimum  of  one  city  in  each  cell. 

The  specific  sampling  procedure  required  identification  of  all 
cities  in  the  inference  space  and  listing  them  in  their  proper  cell 
prior  to  random  sampling.   As  a  first  step  in  determining  the  cities 
in  the  inference  space  all  cities  within  Standard  Metropolitan 
Statistical  Areas  (SMSA)  were  omitted  from  the  sample  space  to  conform 
to  the  scope  of  the  project.   The  excluded  and  eligible  counties  for 
the  four  sampling  periods  are  shown  in  Figures  11  to  lA  adopted  from 
census  publications  (3,4,5,6). 

It  is  noted  that  the  Census  Bureau  in  1940  did  not  designate 
counties  as  metropolitan  areas,  but  the  cities  of  South  Bend,  Fort 
Wayne,  Indianapolis,  Terre  Haute,  and  Evansville  were  designated  as 
metropolitan  areas  (7)  and  for  consistency  all  cities  in  the  counties 
containing  these  metropolitan  areas  were  excluded  from  this  study. 

The  eligible,  non-SMSA,  counties  in  each  of  the  Figures  11  to  14 
were  selected  one  at  a  time  and,  by  use  of  the  Bureau  of  Census  data, 
(3,4,5,6)  all  the  cities  with  greater  than  2,500  populations  were 
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D 


Counties  containing  cities 
Designated  as  Metropolitan 
areas . * 


Figure  11.   Counties  Excluded  (Containing  Cities  Designated  as  Metro- 
politan Areas)  from  1940  Data 


bource: 


U.S.  Bureau  of  Census.   Census  of  Population:   1940,  Vol.  II, 
Characteristics  of  the  Population,  Part  2  Florida-Iowa.   U.S. 
Government  Printing  Office,  Washington,  D.C.  ,  p.  674  (Base  map) 
and  pp.  842-844  (Designation  of  Metropolitan  Areas)'^. 
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Figure  12.   Counties  Excluded  (Standard  Metropolitan  Areas)  from  1950  Data 


Source:   U.S.  Bureau  of  Census.   U.S.  Census  of  Housing:   1950  Vol.  I, 
General  Characteristics,  Part  3,  Idaho-Massachusetts,  U.S. 
Government  Printing  Office,  Washington,  D.  C,  1953. 
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Figure  13.   Counties  Excluded  (Standard  Metropolitan  Statistical  Areas) 
from  1960  Data 


Source:   U.S.  Bureau  of  Census.   U.S.  Census  of  Population:   1960,  Vol.  I, 
Characteristics  of  the  Population.   Part  16,  Indiana.   U.S. 
Government  Printing  Office,  Washington,  D.  C,  1963. 


INDIANA 
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Counties,  Standard  Metropolitan  Statistical  Areas,  and  Selected  Places 
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Figure  14.   Counties  Excluded  (Standard  Metropolitan  Statistical  Areas) 
from  1970  Data 


Source:   U.S.  Bureau  of  Census.   U.S.  Census  of  Population:  1970,  Vol.  1, 
Characteristics  of  Population.   Part  16,  Indiana,  U.S.  Govern- 
ment Printing  Office,  Washington,  D.  C,  April  1973. 
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listed  in  one  of  the  three  population  levels  as  shown  in  Figure  9.   The 
lists  of  cities  in  each  county  were  aggregated  to  the  level  of  one  of 
the  18  Indiana  planning  and  development  regions  shown  in  Figure  10. 
This  in  turn  allowed  further  aggregation  of  the  cities  in  the  sample 
space  to  the  level  of  one  of  the  three  geographical  locations  for  each 
of  the  four  time  periods.   Each  of  the  Tables  3  to  14  represent  the  re- 
sults of  one  such  aggregation  containing  three  cells.   After  a  decision 
as  to  the  percentage  of  sample  space  to  be  studied  each  of  the  36  cells 
in  these  tables  were  randomly  sampled  to  yield  the  list  of  cities  to 
be  investigated. 

The  appropriate  percent  of  population  to  be  sampled  depends  on 
variance  of  variables  to  be  measured.   The  variance  of  the  11  variables 
of  interest  in  this  study  were  not  known.   In  such  a  situation,  multi- 
stage experimentation  normally  was  thought  to  be  helpful,  but  in  this 
case  the  potential  for  instrumentation  error  was  felt  to  be  sufficient 
to  encourage  a  single  stage  sampling.   Table  15  which  is  based  on 
Tables  3  to  14  shows  the  number  of  cities  in  the  sample  space  for  each 
of  the  36  cells.   There  are  a  total  of  363  cities  in  the  sample  space. 
Based  on  the  project's  resources  and  the  desire  for  a  satisfactory 
single  stage  sampling;  an  overall  25  percent  sample  was  chosen.   But 
each  of  the  cells  were  not  sampled  equally,  because  there  were  many 
more  smaller  cities  than  larger  cities,  and  with  equal  percent  sampling 
of  each  cell  the  experiment  would  have  been  biased  against  larger 
cities. 
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Table  3.   List  of  19A0  Cities  in  the  Sample  Space  -  Northern  Zone 


Urban 

Planning 

and  Devel- 

City Population 

opment 

Region* 

2,500-10,000 

10,001-20,000 

20,001-50,000 

No. 

lA 

Valparaiso 

1 

IB 

Rensselaer 

2 

2 

Plymouth 

3 

2 

LaPorte 

4 

2 

Michigan  City 

5 

2 

Warsaw 

6 

2 

Elkhart 

7 

2 

Goshen 

8 

2 

Napanee 

9 

3A 

Columbia  City 

10 

3A 

Angola 

11 

3A 

Kendallville 

12 

3A 

Huntington 

13 

3B 

Decatur 

14 

3B 

Auburn 

15 

3B 

Garrett 

16 

3B 

Bluffton 

17 

4 

Monticello 

18 

4 

Lafayette 

19 

4 

W.  Lafayette 

20 

4 

Crawfordsville 

21 

4 

Attica 

22 

4 

Frankfort 

23 

5 

N.  Manchester 

24 

5 

Wabash 

25 

5 

Tipton 

26 

5 

Peru 

27 

5 

Kokomo 

28 

5 

Rochester 

29 

5 

Logansport 

30 
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Table  4. 

List  of  1940  Cities 

!  in  the  Sample  Space  -  Central  Zone 

Urban 

Planning 

and  Devel- 

City Population 

opment 

Region* 

2,500-10,000 

10,001-20,000    20,001-50,000 

No. 

6 

Union  City 

1 

6 

Winchester 

2 

6 

Alexandria 

3 

6 

Anderson 

4 

6 

Elwood 

5 

6 

Dunkirk 

6 

6 

Portland 

7 

6 

New  Castle 

8 

6 

Fairmount 

9 

6 

Marion 

10 

6 

Muncie 

11 

6 

Hartford  City 

12 

7 

Clinton 

13 

7 

Sullivan 

14 

7 

Green  Castle 

15 

7 

Brazil 

16 

8 

Shelbyville 

17 

8 

Martinsville 

18 

8 

Franklin 

19 

8 

Noblesville 

20 

8 

Greenfield 

21 

8 

Lebanon 

22 

9 

Richmond 

23 

9 

Rushville 

24 

9 

Connersville 

25 

Table  5.   List  of  1940  Cities  in  the  Sample  Space  -  Southern  Zone 
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Urban 
Planning 
and  Devel- 
opment 
Region* 


City  Population 


2,500-10,000    10,001-20,000    20,001-50,000 


No, 


10 

11 

11 

11 

11 

12 

12 

12 

12 

13A 

13A 

13A 

ISA 

13A 

13A 

13A 

13B 

13B 

13B 

13B 

14 

15 

15 

15 

15 


Bloomington 


North  Vernon 

Seymour 

Greensburg 

Batesville 
Madison 
Aurora 
Lawrenceburg 

Mitchell 
Bicknell 

Jasonville 

Linton 

Washington 

Booneville 

Oakland 

Princetown 

Petersburg 

Mt .  Vernon 

Salem 

Tell  City 

HuTitingburg 

Jasper 


Columbus 


Bedford 


Vincennes 


1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

21 

22 

23 

24 

25 


56 


Table  6.   List  of  1950  Cities  in  the  Sample  Space  -  Northern  Zone 


Urban 

Planning 

and  Devel- 

City Population 

opment 

Region* 

2,500-10,000 

10,001-20,000 

20,001-50,000 

No. 

lA 

Chesterton 

1 

lA 

Valparaiso 

2 

IB 

Knox 

3 

IB 

Rensselaer 

4 

2 

Bremen 

5 

2 

Plymouth 

6 

2 

LaPorte 

7 

2 

Michigan  City 

8 

2 

Warsaw 

9 

2 

Elkhart 

10 

2 

Goshen 

11 

2 

Napanee 

12 

3A 

Columbia  City 

13 

3A 

Angola 

14 

3A 

Kendallville 

15 

3A 

Huntington 

16 

3B 

Decatur 

17 

3B 

Auburn 

18 

3B 

Garrett 

19 

3B 

Bluf fton 

20 

4 

Monticello 

21 

A 

Lafayette 

22 

4 

W.  Lafayette 

23 

4 

Crawfordsville 

24 

4 

Delphi 

25 

4 

Attica 

26 

4 

Frankfort 

27 

5 

N.  Manchester 

28 

5 

Wabash 

29 

5 

Tipton 

30 

5 

Peru 

31 

5 

Kokomo 

32 

5 

Rochester 

33 

5 

Logansport 

34 
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Table  7. 

List  of  1950  Cities 

1  in  the  Sample  Space  -  Central  Zone 

Urban 

Planning 

and  Devel- 

^^  f^  TVt  d^^  f^  ^ 

City  Population 

opnient 

Region* 

2,500-10,000 

10,001-20,000    20,001-50,000 

No. 

6 

Union  City 

1 

6 

Winchester 

2 

6 

Alexandria 

3 

6 

Anderson 

4 

6 

Elwood 

5 

6 

Dunkirk 

6 

6 

Portland 

7 

6 

New  Castle 

8 

6 

Fairmount 

9 

6 

Gas  City 

10 

6 

Marion 

11 

6 

Hartford  City 

12 

7 

Clinton 

13 

7 

Sullivan 

14 

7 

Greencastle 

15 

7 

, Brazil 

16 

8 

Shelbyvllle 

17 

8 

Martinsville 

18 

8 

Edinburg 

19 

8 

Franklin 

20 

8 

Greenwood 

21 

8 

Noblesville 

22 

8 

Greenfield 

23 

8 

Danville 

24 

8 

Plainfield 

25 

8 

Lebanon 

26 

9 

Cambridge  City 

27 

9 

Rushville 

28 

9 

Richmond 

29 

9 

Brookville 

30 

9 

Connersville 

31 
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Table  8.   List  of  1950  Cities  in  the  Sample  Space  -  Southern  Zone 


Urban 
Planning 
and  Devel- 
opment 
Region* 


City  Population 


2,500-10,000    10,001-20,000 


20,001-50,000 


No, 


10 

11 

11 

11 

11 

12 

12 

12 

12 

13A 

13A 

13A 

13A 

13A 

13A 

13A 

13B 

13B 

13B 

13B 

13B 

14 

14 

14 

15 

15 

15 

15 

15 


Bloomington 


North  Vernon 

Seymour 

Greensburg 

Batesville 
Madison 
Aurora 
Lawrenceburg 

Mitchell 
Bicknell 

Jasonville 
Linton 

Booneville 

Oakland  City 

Princetown 

Petersburg 

Mt.  Vernon 

Salem 

Austin 

Scottsburg 

Kockport 

Tell  City 

Paoli 

Huntingburg 

Jasper 


Columbus 


Bedford 


Vincennes 


Washington 


1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

21 

22 

23 

24 

25 

26 

27 

28 

29 
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Table  9.   List  of  1960  Cities  in  the  Sample  Space  -  Northern  Zone 


Urban 

Planning 

and  Devel- 

City Popula 

tlon 

opment 

Region* 

2,500-10,000 

10,001-20, 

000 

20,001-50,000 

No. 

IB 

Knox 

1 

IB 

Rensselaer 

2 

2 

Bremen 

3 

2 

Plymouth 

4 

2 

LaPorte 

5 

2 

Michigan  City 

6 

2 

Warsaw 

7 

2 

Elkhart 

8 

2 

Goshen 

9 

2 

Napanee 

10 

3A 

Columbia  City 

11 

3A 

Angola 

12 

3A 

Kendallville 

13 

3A 

Ligonier 

14 

3A 

Huntington 

15 

3B 

Berne 

16 

3B 

Decatur 

17 

3B 

Auburn 

18 

3B 

Garrett 

19 

3B 

Bluffton 

20 

4 

Monticello 

21 

4 

Lafayette 

22 

4 

W.  Lafayette 

23 

4 

Crawfordsville 

24 

4 

Delphi 

25 

4 

Attica 

26 

4 

Covington 

27 

4 

Frankfort 

28 

5 

N.  Manchester 

29 

5 

Wabash 

30 

5 

Tipton 

31 

5 

Peru 

32 

5 

Kokomo 

33 

5 

Rochester 

34 

5 

Logansport 

35 
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Table  10.   List  of  1960  Cities  in  the  Sample  Space  -  Central  Zone 


Urban 
Planning 
and  Devel- 
opment 
Region* 


6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
7 
7 
7 
7 
7 
8 


City  Population 


2,500-10,000    10,001-20,000    20,001-50,000 


Union  City 
Winchester 
Alexandria 

Chesterfield 

Dunkirk 
Portland 

Fairmount 
Gas  City 

Hartford  City 

Clinton 

Sullivan 

Greencastle 

Rockville 

Brazil 

Martinsville 

Mooresville 

Edinburg 

Franklin 

Greenwood 

New  Whiteland 

Noblesville 

Greenfield 

Brownsburg 

Danville 

Plainf ield 

Lebanon 

Cambridge  City 

Rushville 
Brookville 


Anderson 


Elwood 


New  Castle 


Marion 


Shelbyville 


Richmond 


Connersville 


No. 

1 

2 

3 

4 

5 

6 

7 

8 

9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 
30 
31 
32 
33 
34 
35 
36 
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Table  11.   List  of  1960  Cities  in  the  Sample  Space  -  Southern  Zone 


Urban 

Planning 

and  Devel- 

opment 

City  Population 

Region* 

2,500-10,000 

10,001-20,000 

20: 

,001-50,000 

No. 

10 

Bloomington 

1 

10 

Spencer 

2 

11 

North  Vernon 

3 

11 

Seymour 

4 

11 

Greensburg 

5 

11 

Columbus 

6 

12 

Batesville 

7 

12 

Madison 

8 

12 

Aurora 

9 

12 

Lawrenceburg 

10 

13A 

Loogootee 

11 

13A 

Bedford 

12 

13A 

Mitchell 

13 

13A 

Bicknell 

14 

13A 

Vincennes 

15 

13A 

Linton 

16 

13A 

Washington 

17 

13B 

Booneville 

18 

13B 

Oakland  City 

19 

13B 

Princetown 

20 

13B 

Petersburg 

21 

13B 

Mt.  Vernon 

22 

14 

Salem 

23 

14 

Austin 

24 

14 

Scottsburg 

25 

14 

Corydon 

26 

15 

Rockport 

27 

15 

Tell  City 

28 

15 

Paoli 

29 

15 

Huntingburg 

30 

15 

Jasper 

31 
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Table  12,   List  of  1970  Cities  in  the  Sample  Space  -  Northern  Zone 


Urban 
Planning 
and  Devel- 
opment 
Region* 


City  Population 


2,500-10,000    10,001-20,000    20,001-50,000 


No. 


IB 

IB 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

3A 

3A 

3A 

3A 

3A 

3B 

3B 

3B 

3B 

3B 

4 

4 

4 

4 

4 

4 

4 

5 

5 

5 

5 

5 

5 

5 


Knox 

Rensselaer 
Bremen 
Plymouth 

Long  Beach 

Trial  Creek 
Westville 
Warsaw 
Winona  Lake 


Napanee 
Columbia  City 
Angola 

Kendallville 
Ligonier 

Berne 

Decatur 

Auburn 

Garrett 

Bluffton 

Monticello 

Delphi 
Fowler 
Attica 
Covington 

N.  Manchester 

Tipton 


Rochester 


LaPorte 
Michigan  City 


Elkhart 


Goshen 


Huntington 


Crawfordsville 


Frankfort 

Wabash 

Peru 


Kokomo 


1 

2 

3 

A 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

21 

22 

23 

24 

25 

26 

27 

28 

29 

30 

31 

32 

33 

34 

35 

36 

37 
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Table  13.   List  of  1970  Cities  in  the  Sample  Space  -  Central  Zone 


City  Population 


Urban 
Planning 
and  Devel- 
opment   

Region*  2,500-10,000    10,001-20,000    20,001-50,000  No. 

6  Union  City  1 

6  Winchester  2 

6  Alexandria  3 

6  Dunkirk  4 

6  Portland  5 

6  New  Castle  6 

6  Fairmount  7 

6  Gas  City  8 

6  Marion  9 

6  Upland  10 

6  Hartford  City  11 

7  Greencastle  12 
7  Rockville  13 
9  Cambridge  City  14 
9  Richmond  15 
9  Rushville  16 
9  Brookville  .  17 
9                       Connersville  18 
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Table  lA.   List  of  1970  Cities  in  the  Sample  Space  -  Southern  Zone 


Urban 

Planning 

and  Devel- 

City Population 

opment 

Region'"' 

2,500-10,000 

10,001-20,000 

20,001-50,000 

No. 

10 

Bloomington 

1 

11 

Mt.  Vernon 

2 

11 

Seymour 

3 

11 

Greensburg 

A 

11 

Columbus 

5 

12 

Batesville 

6 

12 

Hanover 

7 

12 

Madison 

8 

12 

Aurora 

9 

12 

Lawrenceburg 

10 

13A 

Loogootee 

11 

13A 

Bedford 

12 

13A 

Mitchell 

13 

13A 

Bicknell 

14 

13A 

Vincennes 

15 

13A 

Bloomf ield 

16 

13A 

Linton 

17 

13A 

Washington 

18 

13B 

Booneville 

19 

13B 

Fort  Branch 

20 

13B 

Oakland  City 

21 

13B 

Princetown 

22 

13B 

Petersburg 

23 

13B 

Mt.  Vernon 

24 

14 

Salem 

25 

14 

Scottsburg 

26 

14 

Corydon 

27 

15 

Rockport 

28 

15 

Tell  City 

29 

15 

Paoli 

30 

15 

Huntingburg 

31 

15 

Jasper 

32 
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The  cells  were  sampled  according  to  a  maximum  criteria  of  20 
percent  and  a  minimum  criteria  of  two  cities.   The  actual  number  of 
cities  to  be  sampled  from  each  cell  was  included  in  each  of  the  36 
cells  of  Tabl.e  15.   The  only  exception  to  the  above  minimum  criteria 
occurred  in  three  out  of  the  thirty-six  cells  which  contained  only  one 
city. 

Once  the  number  of  cities  to  be  sampled  in  each  of  the  36  cells 
were  known,  a  random  number  table  was  used  to  obtain  names  of  the 
actual  cities  to  be  sampled  from  each  cell  among  the  cities  listed  in 
Tables  3  to  12.   The  sample  cities  so  chosen  are  listed  in  Tables  16 
to  19.   Above  tables  are  ordered  to  represent  all  the  sample  cities 
for  the  time  periods  of  1940,  1950,  1960  or  1970  for  the  areas  shown 
in  Figures  H  to  14.   The  above  scheme  resulted  in  98  cities  to  be 
studied,  which  means  27  percent  of  cities  in  the  sample  space  were 
sampled. 

Data  Collection 
It  is  necessary  to  measure  the  values  of  the  11  model  variables 
listed  in  Table  2  for  the  98  sample  cities  listed  in  Tables  16  to  19 
to  calibrate  the  path  model  of  equations  6a  to  6i.   But  prior  to 
actual  data  collection,  the  11  model  variables  are  to  be  further 
defined.   The  names  of  the  variables  as  given  in  Table  2  generally 
convey  the  basic  meaning  of  each  of  the  variables,  but  for  data 
collection  purposes  specific  operational  definitions  of  these 
variables  are  needed.   The  operational  definitions  and  measurement 
units  differ  from  one  study  to  another  based  on  the  dimensions  of 
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Table  16,   Cross-Sectional  Samples  of  Non-Metropolitan  Indiana  Cities 
for  1940 


Location 


2,500-10,000 


City  Population 


10,001-20,000 


20,001-50,000 


North 

Valparaiso 

Huntington 

Lafayette 

North 

Warsaw 

Frankfort 

Kokomo 

North 

Kendallville 

North 

Attica 

Central 

Dunkirk 

Shelbyville 

Muncie 

Central 

Hartford  City 

Connersville 

Richmond 

Central 

Greencastle 

South 

Greensburg 

Bedford 

Bloomington 

South 

Batesville 

Vincennes 

South 

Linton 

South 

Jasper 

Table  17.   Cross-Sectional  Samples  of  Non-Metropolitan  Indiana  Cities 
for  1950 


City  Population 

Location 

2,500-10,000 

10,001-20,000 

20,001-50,000 

North 

Napanee 

LaPorte 

Kokomo 

North 

Monticello 

Frankfort 

Logansport 

North 

Tipton 

North 

Rochester 

Hartford  City 

Central 

Alexandria 

New  Castle 

Anderson 

Central 

Portland 

Shelbyville 

Richmond 

Central 

Greencastle 

Central 

Franklin 

South 

Aurora 

Bedford 

Bloomington 

South 

Bicknell 

Vincennes 

South 

Petersburg 

South 

Tell  City 

South 

Jasper 

68 


Table  18.   Cross-Sectional  Samples  of  Non-Metropolitan  Indiana  Cities 
for  1960 


City  Population 

Location 

2,500-10,000 

10,001-20,000 

20,001-50,000 

North 

Plymouth 

Goshen 

Michigan  City 

North 

Berne 

W.  Lafayette 

Logansport 

North 

Covington 

North 

Tipton 

Central 

Chesterfield 

Elwood 

Anderson 

Central 

Rockville 

Shelbyville 

Richmond 

Central 

Martinsville 

Central 

Edinburg 

Central 

Noblesville 

Central 

Lebanon 

South 

Lawrenceburg 

Madison 

Blooraington 

South 

Bicknell 

Washington 

Columbus 

South 

Boonevllle 

South 

Petersburg 
Scottsburg 

Table  19.   Cross-Sectional  Sample  of  Non-Metropolitan  Indiana  Cities 
for  1970 


City  Population 

Location 

2,500-10,000 

10,001-20,000 

20,001-50,000 

North 

Knox 

Goshen 

Michigan  City 

North 

Long  Beach 

Crawfordsville 

Elkhart 

North 

Delphi 

North 

Fowler 

North 

Covington 

Central 

Portland 

Connersville 

New  Castle 

Central 

Upland 

Richmond 

Central 

Hartford  City 

South 

Bicknell 

Madison 

Bloomington 

South 

Bloomf ield 

Washington 

Columbus 

South 

Booneville 

South 

Fort  Branch 

South 

Jasper 

69 


inference  space  and  level  of  aggregation.   In  the  following  sections 
the  11  variables  are  operationally  defined  and  based  on  the  given 
definition  their  respective  value  is  measured. 

City  Location  (XI) 
Indiana  cities  were  stratified  to  northern,  central  and  southern 
zones  prior  to  randomized  sampling  to  test  for  significance  of  the 
geographic  location.   If  the  three  zones  proved  significant  then 
further  disaggregation  of  each  of  the  three  zones  could  test  for 
significance  of  the  city  location  variable   on  the  level  of  each  of 
the  18  Indiana  development  and  planning  regions  shown  in  Figure  10. 
One  way  to  represent  the  location  variable  in  the  model  was  by  the 
use  of  the  dummy  variables  (North=00,  central=01,  and  south=10) . 
Dummy  variables  in  the  path  model  are  normally  presented  as  continuous 
variables  (8) ;  and  this  was  accomplished  by  measuring  vertical  dis- 
placement of  sample  cities  in  miles  relative  to  the  location  of 
Indianapolis  at  the  center  of  the  state.   These  distances  which  were 
scaled  as  positive  north  of  Indianapolis  and  negative  south  of 
Indianapolis  are  shown  in  Table  20.   Measured  values  of  locations  thus 
obtained  were  continuous  dummy  variables  with  a  horizontal  line  thru 
the  center  of  Indianapolis  as  the  reference  line;  similar  results 
would  have  been  obtained  if  the  northern  or  southern  state  boundary 
would  have  been  used  as  the  reference  line. 

Inter-City  Accessibility  (X2) 
It  is  believed  that  city  development  is  significantly  affected 
by  its  locational  advantage  in  relation  to  the  regional  markets.   A 
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modified  form  of  "Interaction  Potential"  is  used  to  obtain  the 

indices  of  location  advantage  of  sample  cities.   The  "Interaction 

Potential"  is  defined  by  Shafran  and  Wlgman  (9)  as: 

m   P.P. 

I. P.  (i,m)  =   E   -i- 1  (9) 

J=l     Ij 

where 

I. P.  (i,m)  =  interaction  potential  between  growth  point  1  and  ra 
regional  markets, 

P.         =  population  of  growth  point  1, 

P.         =  population  of  growth  point  j. 

t..        =  minimum  path  travel  time  between  growth  point  1  and 
regional  market  j . 
For  use  in  the  calibration  of  the  path  model  a  modified  "Interaction 
potential"  is  obtained  by  normalizing  the  interaction  potential  by 
the  population  of  city  under  consideration.   The  modified  equation  10 
still  is  a  good  measure  of  potential  for  interaction  of  city  1  in 
respect  to  all  other  growth  points  in  the  study  areas  but  does  not 
multiply  the  resultant  index  with  the  population  of  the  city  for  which 
the  index  is  being  calculated.   The  basic  reason  for  this  omission  is 
that  city  population  is  being  used  as  a  separate  variable  In  the  model 
and  direct  use  of  equation  constitutes  a  double  count  of  population. 

m   P. 

I.P.d.m)   ....  ,  =   Z   -^  (10) 

modified    .  ,   t .  . 

The  calculation  of  modified  interaction  potential  for  sample 
cities  required  population  data  for  all  the  growth  centers  In  the 
region  as  well  as  the  minimum  travel  time  between  the  sample  cities 
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and  all  other  growth  centers  in  the  region.   The  region  used  for 
purpose  of  calculation  of  interaction  potential  is  larger  than  the 
primary  study  area  which  is  limited  to  the  state  of  Indiana  and  the 
population  of  the  growth  centers  often  include  more  than  population 
within  one  city  limit. 

Limits  of  the  Study  Region  for  Calculation  of  Interaction  Potential 
The  study  region  for  calculation  of  interaction  potential  as  a 
proxy  for  inter-city  accessibility  is  larger  than  the  State  of  Indiana 
(the  primary  study  area)  because  Indiana  cities  have  continuous 
economic  and  cultural  exchange  with  growth  centers  outside  the  State 
of  Indiana. 

As  the  distance  between  an  Indiana  city  and  a  foreign  city  in- 
creases, the  transportation  system  outside  Indiana  plays  a  more 
important  role  in  the  potential  for  interaction  and  the  differences 
between  the  locational  advantage  of  an  Indiana  city  with  respect  to  a 
highway  system  inside  Indiana  becomes  less  important  in  the  calcula- 
tion of  interaction  potential.   Therefore,  the  study  region  for 
calculation  of  modified  interaction  potential  was  limited  to  the 
State  of  Indiana  and  parts  of  the  neighboring  states. 

Two  different  criteria  were  used  to  choose  the  portions  of  the 
study  area  outside  the  State  of  Indiana:   first,  cities  in  the 
neighboring  state  which  were  located  in  the  two  counties  adjoining 
Indiana  were  included  in  the  study  area,  second,  other  cities  with 
population  of  100,000  or  more  and  within  100  miles  of  Indiana  borders 
were  included. 
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The  last  criterion  was  time  dependent  and  results  in  slightly 
different  study  areas  for  the  study  periods  of  19A0,  1950,  1960  and 
1970.   As  an  example,  the  study  area  for  1940  is  shown  in  Figure  15. 
which  is  a  reproduction  of  the  coded  19A0  highway  network.   The 
variable  portion  of  the  study  areas  for  other  study  periods  are  not 
graphically  presented  but  the  cities  included  in  each  of  the  four 
study  periods  are  shown  in  Tables  A2  to  A5  of  Appendix  A. 

Minimum  Travel  Time 

Travel  times  (T..)  between  each  of  the  sample  cities  and  all 
other  cities  in  the  study  area  was  required  for  calculation  of  inter- 
action potentials.   The  minimum  travel  time  between  two  cities  depends 
on  highway  distance  between  the  two  and  highway  travel  speeds  (running 
speeds) • 

Once  highway  speeds  and  distances  were  known  then  the  highway 
system  was  coded  and  a  "minimum  tree  algorithm"  applied  to  calculate 
the  minimum  travel  time  between  different  locations  within  the  network. 
In  this  project,  the  historical  highway  travel  speeds  were  not  readily 
available  and,  as  explained  in  the  following  sections,  were  estimated 
by  assuming  that  the  ratio  of  running  speed  and  spot  speed  for  a 
particular  highway  type  was  a  constant  from  1940  to  1960. 

Highway  Speeds 

Highway  speeds  depend  on  a  complex  set  of  variables  but  generally 
were  increasing  from  1940  to  1970  and  at  a  given  time  were  higher  on 
highways  with  higher  design  standards. 
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FIGURE  15.       CODED     REGIONAL    HIGHWAY     NETWORK    USED 

FOR    CALCULATION    OF    INTERACTION    POTENTIAL"  1940 
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Information  sources  available  for  determination  of  highway  speeds 
for  the  four  study  periods  were  traffic  speed  reports  prepared  by  the 
Joint  Highway  Research  Project  (10,11,12),  a  research  report  by  Mr. 
Frost  (13)  which  gives  information  about  1940  highway  speeds  and 
Vodrazka's  Ph.D.  thesis  (14)  which  serves  as  a  guide  for  choice  of 
appropriate  highway  travel  speed  as  a  function  of  published  spot 
speeds. 

Vodrazka  studied  statewide  intercity  travel  in  Indiana  for  the 
purpose  of  determining  the  future  need  for  upgrading  of  certain  Indiana 
highways  to  freeway  standard.   In  his  1966  study  Vodrazka  assigns 
travel  speeds  to  different  classes  of  Indiana  highways  as  shown  in 
Table  21. 

Eighty-five  percentile  speeds  are  available  for  1960  and  1970 
(10,11).   Speed  changes  between  1960  and  1970  as  well  as  linear 
regression  lines  representing  spot  speed  trends  for  different  types  of 
highways  were  previously  calculated  (10),  and  are  shown  in  Figure  16. 

Lines  with  the  slopes  of  1960-1970  spot  speed  trend  line  were 
passed  thru  corresponding  travel  speeds  used  by  Vodrazka  to  yield  an 
estimate  of  a  reasonable  travel  speed  for  1960  and  1970.   Rural  travel 
speeds  for  two-lane,  four-lane  and  Interstate  highways  were  thus  ob- 
tained from  Figure  16  and  are  shown  in  Table  22. 

Travel  speeds  for  1950  were  estimated  by  adjustment  of  June  1950 
spot  speeds.  The  ratio  of  mean  spot  speed  and  estimated  travel  speed 
for  1960  is  shown  in  column  4  of  Table  23.  It  was  assumed  that  these 
ratios  were  approximately  valid  for  1950  and  were  multiplied  with  the 
1950  mean  spot  speeds  to  result  in  1950  estimated  rural  travel  speeds 
for  two  lane  and  four  lane  highways  which  are  shown  in  column  six  of 
Table  23. 
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Table  21.   Assumed  Highway  Travel  Speeds  -  1966 

Travel  Speeds 

Highway  Type  (mph) 

Interstate  60 

4-Lane  Rural  50 

2-Lane  Rural  45 

4-Lane  Urban  30 

2-Lane  Urban  15 


Source:   Vodrazka,  Walter  Charles,  "Subclassif ication  of  the  State 
Highway  System  of  Indiana  Based  on  Synthesis  of  Intercity 
Travel",  Ph.D.  Thesis,  Purdue  University,  W.  Lafayette, 
Indiana,  June  1968. 
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Date  (Year) 


Figure   Ifi.       Riirnl    Trnvol    Speed    1960-1.970 
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Table  22.   Estimated  Rural  Highway  Travel  Speeds  -  1960  and  1970 


1960 

1970 

Measured  85% 

Estimated 

Measured  85% 

Estimated 

Spot  Speed 

Running  Speed 

Spot  Speed 

Travel  Speed 

Highway  Type 

(mph) 

(mph) 

(mph) 

(mph) 

Column  1 

Column  2* 

Column  3* 

Column  4* 

Column  5* 

Interstate 

62 

52 

75 

65 

4-Lane 

64 

46 

72 

52 

2-Lane 

60 

40 

70 

48 

*Source  Figure  16. 


Table  23.   Estimation  of  Average  Travel  Speeds  for  Rural  Highways  in 
Indiana  -  1950 


Column  1 

Column  2 

Column  3 

Column  4 

Column  5 

Column  6 

Mean  Spot 

Assumed 

Ratio  of 

Mean 

Speed  for 

Travel 

Travel  Speed 

Spot 

Estimated 

All  Autos 

Speed 

to  Spot  Speed 

Speed 

Travel  Speed 

Highway 

in  1960(1) 

for  1960(1) 

for  1960: 

1950  (2) 

Speed  1950: 

Type 

(mph) 

(mph) 

(C0I.3/C0I.2) 

(mph) 

(Col.4xCol.5) 

2-Lane 

56 

40 

0.72 

52 

38 

4-Lane 

46 

0.82 

56 

46 

Notes:   (1)   Values  obtained  from  Figure  16. 

(2)   Source:   Traffic  Speed  Report  44  (Reference  12) 


79 


Frost  in  a  study  of  "Indiana  war  time  travel  speeds"  gives  the 
overall  50  percentile  spot  speed  measured  in  February  1942  as  49.56 
miles  (13).   Assuming  a  linear  speed  trend  between  1940  and  1960  the 
overall  50  percentile  spot  speed  was  adjusted  to  46  mph  for  1940.   A 
further  assumption  that  the  ratio  of  50  percentile  spot  speed  to 
running  speed  remains  the  same  as  that  of  1960  (shown  in  Table  22) 
allows  the  estimation  of  a  33  mph  running  speed  (46  x  0.72)  for  1940 
two-lane  rural  highways. 

Historical  highway  speeds  in  urban  areas  were  not  available  but 
it  was  necessary  to  assume  urban  highway  speed  to  enable  "loading"  the 
network.   Assumed  urban  highways  travel  speeds  for  the  four  study 
periods  are  shown  in  Table  24.   It  is  believed  that  these  assumptions 
are  reasonable.   Minimum  travel  times  are  not  very  sensitive  to  urban 
travel  speeds  because  the  magnitude  of  intercity  travel  over  urban 
highways  are  usually  less  than  one  mile. 

Coding  the  Highway  Network 

Highway  maps  of  Indiana  (15,16,17,18),  Illinois  (19,20,21,22), 
Kentucky  (23,24,25,26),  Michigan(27 ,28, 29,30)  and  Ohio  (31,32,33,34) 
were  used  to  code  the  regional  highway  network  for  the  four  study 
periods  of  1940,  1950,  1960  and  1970.   Highways  on  these  maps  were 
classified  as  to  number  of  lanes  and  type  of  access.   Therefore,  high- 
way travel  speeds  obtained  in  the  last  section  were  used  to  assign 
travel  speeds  to  the  highway  links.   Highway  distances  were  usually 
read  from  the  map  or  actual  highway  mileage  were  scaled.   The  scale 
of  the  maps  used  ranged  from  1  in.  =  7. 9  miles  to  1  in.  =  16  miles  which 
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Table  24.   Assumed  Urban  Highway  Speed  for  Coding  Inter-City  Highway 
Network* 


Highway  Type 

1940 

Speed 

(mph) 

1950 

Speed 

(mph) 

1960 

Speed 

(mph) 

15 

1970 

Speed 

(mph) 

Two-Lane 

15 

15 

20 

4-Lane 

N.A. 

N.A. 

27 

32 

Urban  Freeway 

N.A. 

N.A. 

40 

40 

*Urban  highways  are  used  only  for  loading  the  inter-city  network. 


N.A.  -  not  applicable.  The  highway  maps  available  do  not  distinguish 
between  two-lane  highways  and  multi-lane  highways  in  the  urban 
areas . 
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allowed  scaling  of  highway  distances  to  a  tenth  of  a  mile.   A  few 
highway  links  in  the  1940  network  were  not  paved  with  asphalt  or 
concrete  and  their  travel  speed  was  penalized  from  33  miles  per  hour 
to  28  miles  per  hour. 

Growth  centers  with  greater  than  2,500  population  were  coded  as 
a  centroid.   The  population  of  the  growth  center  included  that  of  the 
central  city  as  well  as  that  of  the  fringe  areas.   Each  of  the 
urbanized  areas  and  standard  statistical  areas  in  the  region  were 
considered  as  one  growth  center.   For  other  places,  the  population  of 
villages  and  towns  and  cities  as  close  as  three  miles  to  each  other 
were  aggregated  to  a  single  centroid.   The  number  of  centroids  as  well 
as  the  number  of  nodes  and  links  for  the  network  of  each  of  the  four 
study  periods  is  shown  in  Table  25. 

Centroids  were  connected  to  the  network  nodes  by  dummy  links. 
The  dummy  links  were  used  to  load  the  network  and  were  assigned  a 
distance  to  account  for  the  distance  from  the  homes  to  main  city 
arterials .   For  urbanized  areas  these  distances  were  assumed  to  be 
one-fourth  the  city's  average  dimensions.   For  smaller  centroids,  the 
following  impedance  distances  which  were  obtained  from  Vodrazka  (35) 
were  assigned  to  the  dummy  links.   Also  some  highway  links  contained 
one  or  more  towns  with  less  than  2,500  populations.   Passing  thru 
these  small  population  centers  cause  travel  delay  which  was  accounted 
for  by  increasing  the  actual  highway  mileage  by  the  same  impedance 
distances. 
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Table  25.   Size  of  Coded  Regional  Highway  Network  -  1940  to  1970 


Network 
Element 


Centroid 

Nodes 

Links 


Number*  of  each  element  of  the  network  in  the  year 
1940  1950  1960  1970 


175 

744 

2010 


■  194 
903 

2576 


218 
856 

2386 


2  34 
1354 
4212 


*The  numbers  given  are  approximate.   It  was  necessary  to  keep  the  same 
centroid  numbers  for  each  decade.   Therefore  originally  centroids  for 
1970  were  numbered  from  1  to  239.   Wlien  a  decrease  in  population 
caused  a  1970  centroid  not  to  be  considered  a  centroid  in  1960  net- 
work its  number  was  kept  and  it  was  considered  as  a  dummy  centroid. 
There  are  4  dummy  centroids  in  1970,  11  dummy  centroids  in  1960, 
70  dummy  centroids  in  1950,  and  84  dummy  centroids  in  1940  highway 
network.   Each  dummy  centroid  has  two  associated  dummy  links. 
Additionally  there  are  some  other  extra  nodes. 
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City  Size  Impedance  Distance 

Over  5,000  1.0  mile 

1,000-5,000  0.5  mile 

100  -1,000  0.2  mile 

It  was  assumed  that  all  links  were  two  way  links  and  distance  as 
well  as  speed  on  both  directions  was  the  same.  As  a  result  of  this 
assumption  only  one  way  links  were  coded  and  a  small  computer  program 
was  written  to  read  the  four  pieces  of  information  for  each  link 
(Node  1,  distance,  speed,  Node  2),  calculate  the  travel  time  and  re- 
write the  complete  information  for  both  directions  of  travel.   Next, 
a  sort  routine  was  used  to  rearrange  the  links  in  the  increasing  order 
of  Node  1  and  then  on  the  increasing  order  of  Node  2.   The  computer 
program  TNET-5  (36)  was  used  to  check  the  network  to  assure  that  all 
the  links  were  two  way,  ordered  correctly,  and  there  were  no  missing 
nodes  or  centroida.   Once  a  correct  network  was  available  then  it  was 
used  to  calculate  the  minimum  tree  paths  between  the  centroids. 

The  plans  of  coded  1940,  1950,  1960  and  1970  highway  network  as 
well  as  the  listings  of  corresponding  coded  highway  network  are 
available  in  the  Purdue  University  Joint  Highway  Research  Program.   As 
an  example,  and  to  show  the  limits  of  the  study  area,  the  1940  coded 
highway  network  is  shown  in  Figure  15. 

Algorithm  Used  to  Calculate  Minimum  Travel  Time 
Between  Centroids 

The  coded  highway  networks  were  used  to  calculate  the  minimum 
travel  paths  between  the  centroids.   The  algorithm  used  was  that  of 
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the  British  Road  Laboratory,  as  used  in  the  Tnet-10  computer  program 
developed  by  the  University  of  California  at  Berkeley  (37).   The 
dimensions  of  this  program  were  modified  to  enable  the  solution  of 
minimum  path  trees  for  relatively  large  networks  analyzed  in  this 
project . 

The  output  of  this  program  for  each  study  period  was  the  minimum 
path  trees  from  each  centroid  to  all  other  centroids  and  a  minimum 
time  matrix  between  the  centroids.   These  outputs  are  not  presented 
here  because  of  the  space  limitation,  but  are  stored  on  magnetic  tape 
for  future  reference.   The  minimum  time  matrix  for  each  period  then 
was  used  in  calculation  of  the  interaction  potential  for  the  respective 
time  period. 

Calculation  of  Interaction  Potentials 

Once  the  minimum  travel  times  were  calculated,  interaction 
potential  in  accordance  with  equation  10  was  calculated.   The  other 
data  requirement  for  this  equation  was  the  population  of  the  centroid 
which  was  obtained  from  previously  referenced  census  publications  (3,4, 
5,6). 

Program  "potential",  shown  in  Table  Al  of  Appendix  A,  was  written 
to  compute  interaction  potentials  for  all  the  centroids  in  the  study 
area.   Interaction  potentials  for  the  sample  cities  are  shown  in 
column  4  of  Tables  26  to  29.   Column  3  of  these  tables  show  the  popula- 
tion of  the  centroid  used  in  calculation  of  the  interaction  potentials. 
These  populations  were  larger  than  the  population  of  the  city  proper 
used  in  the  path  model  and  given  in  Tables  37  to  40. 


Table  26.   Interaction  Potential  for  1940  Sample  Cities 
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Population 

Centroid 

of 

Interaction 

No. 

Name  of  Centroid 

Centroid 

Potential 

11 

WLPPiRftlSO 

873G 

G.2G3703E+04 

13 

KEMDALLUILLE 

5431 

4.454491E+04 

21 

WARSAW,  liimm   LAKE 

7121 

4.582030E+04 

2? 

HUNTIISGTON 

13903 

4.478GB5E+04 

37 

KDK0I10 

33795 

4.310595E+04 

38 

HARTFORD  CITY 

B34B 

4.253133E+04 

40 

LAFAYETTE.  WEST  LAFAYETTE 

350G8 

4.402127E+04 

42 

ATTICA, WILLI AMSPORT 

4982 

4.145354E+04 

43 

FRANKFORT 

1370S 

4.255B29E+04 

47 

MUMCIE  METROPOLITAN  AREA 

49720 

4.3G0118E+04 

58 

RICHMOND,  CENTERUILLE. BOSTON 

3G491 

4.22G820E+04 

G7 

GREENCASTLE 

4872  . 

3.812224E+04 

71 

CONNERSUILLE 

1289B 

3.917343E+04 

72 

SHELBYUILLE  CITY 

10791 

4.15G110E+04 

79 

GREENSBURG 

BOGS 

3.891172E+04 

81 

BATESUILLE 

30G5 

3.754302E+04 

83 

BLOOMINGTON 

20870 

3.555374E+04 

87 

LINTON 

B2G3 

3.1G0982E+04 

90 

BEDFORD  .OOLITIC 

13700 

3.301439E+04 

95 

UINCENNES 

18228 

3.058113E+04 

102 

JASPER 

5041 

2.788404E+04 

Table  27.   Interaction  Potential  for  1950  Sample  Cities 
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Population 

Centroid 

of 

Interaction 

No. 

Name  of  Centroid 

Centroid 

Potential 

24 

ROCHESTER 

4E73 

5.00G352E+04 

30 

MONT I CELLO 

34G7 

4.882128E+04 

31 

LOGftrSSPORT  CITY 

21031 

4.9?i792E+04 

37 

KOKOnO 

38672 

4.393729E+04 

38 

HARTFORD 

7253 

5.0G1418E+04 

39 

PORTLAND 

70G4 

4.931728E+04 

43 

FRANKFORT 

15028 

4.954254E+04 

44 

TIPTON 

5G33 

5.018552E+04 

4G 

ALEXANDRIA,  ORESTES 

5G29 

5.2371G9E+04 

51 

ANDERSON,  CHESTERFIELD, 

EDGEW  48702 

5.274507E+04 

57 

NEW  CASTLE 

18271 

4.884838E+04 

58 

RICHMOND,  CENTERUILLE, BOSTON   41182  ■ 

5.303423E+04 

G? 

GREENCASTLE 

G8S3 

4.4177G5E+04 

72 

SHELBYUILLE  CITY 

11734 

5.1179G7E+04 

73 

FRANKLIN 

731G 

5.003118E+04 

83 

BLOOMINGTON 

281G3 

4.071288E+04 

8G 

AURORA 

4780 

4.824035E+04 

90 

BEDFORD  , OOLITIC 

13G87 

3.825397E+04 

92 

BICKNELL 

4572 

3.50G8D8E+04 

95 

UINCENNES 

18B31 

3.577351E+04 

101 

PETERSBURG  CITY 

3035 

3.382525E+04 

102 

JASPER 

5215 

3.247754E+04 

115 

TELL  CITY 

5735 

2.9G4402E+04 

151 

NAPANEE 

3393 

5.128927E+04 

Table  28.   Interaction  Potential  for  1960  Sample  Cities 
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Centroid 

No. 

Name  of  Centrold 

p. 

opulation 

of 
Centrold 

Interaction 

Potential 

1 

MICHIGAN  CITY, LONG  BEACH+TUO 

41151 

9.403323E+04 

9 

GOSHEN 

13718 

8.419G0GE+04 

IS 

PLYMOUTH 

7558 

8.832902E+04 

31 

LOGftNSPORT  CITY 

2110G 

7.71020GE+04 

34 

BERNE  CITY 

2B44 

G.932143E+G4 

40 

LAFFAYETTE,  WEST  LfiFFAYETTE 

55010 

7.053453E+04 

44 

TIPTON 

5504 

7.767BE2E+04 

45 

ELUOOD 

11793 

7.527371E+04 

50 

COUINGTON 

2759 

G.589713E+04 

51 

ANDERSON,  CHESTERFIELD. 

EDGEU 

537EB 

7.112907E+04 

51 

ANDERSON,  CHESTERFIELD. 

EDGEW 

537G8 

7.112907E+04 

53 

LEBANON, ULEN 

9904 

7.25324eE+04 

54 

NOBLESUILLE 

7GG4 

8.131942E+04 

58 

RICHMOND,  CENTERUILLE, BOSTON 

4B7G7 

7.BG4792E+04 

72 

SHELBYUILLE  CITY 

14317 

7.43239GE+04 

7G 

MARTINSUILLE 

7525 

E.957532E+04 

70 

EDINBURG 

3GG4 

B.593531E+04 

82 

COLUMBUS 

20778 

B.333144E+04 

83 

BLOOMINGTON 

31357 

6.321825E+04 

91 

MADISON, HANOUER 

11G58 

5.797B22E+04 

92 

EICKNELL 

3878 

5.231502E+04 

SB 

WASHINGTON 

1084B 

5.2G1159E+04 

101 

PETESBURG  CITY 

2939 

4.999482E+04 

110 

BOONUILLE 

4801 

4.G25213E+04 

162 

LAWRENCEBURG, GREENDALE, 

HARDIN 

8083 

5.302948E+04 

1G4 

SCOTTSBURG 

3810 

G.331375E+04 

Table  29.   Interaction  Potential  for  1970  Sample  Cities 
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I 

'opulatlon 

Centroid 

of 

Interaction 

No. 

Name  of  Centroid 

Centroid 

Potential 

1 

MICHIGflN  CITY. LONG 

BEACH+TWO 

45813 

1.3232G1E+05 

1 

MICHIGAM  CITY, LONG 

BEACH+TWO 

45813 

1.3232G1E+05 

3 

ELKHART  CITY 

43152 

1.4G23G2E+05 

3 

GOSHEM 

17171 

1.45787GE+05 

19 

KMOX 

3519 

1.200493E+05 

35 

FOWLER 

2G43 

1.093973E+05 

38 

HARTFORD 

8207 

1.1592S8E+05 

39 

PORTLAND 

7115 

1.1825D0E+05 

50 

COUINGTON 

2G41 

9.97593GE+04 

52 

CFAUFORDSUILLE 

1384E 

1.053151E+05 

5R 

UPLAHD 

3202 

l,20503GE+05 

57 

HEW  CASTLE 

21215  ■ 

1.132G31E+05 

58 

RICHMOHD,  CEMTERUILLE. BOSTON 

4G539 

l,120502E+05 

71 

CONNERSUILLE 

17G04 

1.094747E+05 

82 

COLUMBUS 

27141 

1.013708E+05 

83 

BLOOM INGTON 

42890 

8.S47GG7E+04 

91 

MADISON, HANOUER 

1B099 

8.913003E+04 

92 

BICKNELL 

3717 

7.2G7331E+04 

9B 

WASHINGTON 

11358 

7.G4G498E+04 

102 

JASPER 

8G41 

7.2G2431E+04 

109 

FORTBRANCH 

2535 

7.G58G5GE+04 

110 

BOONUILLE 

573S 

7.499043E+04 

111 

BLOOMFIELD 

25B5 

8.0GG977E+04 

157 

DELPHI 

2582 

1.084019E+05 
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The  project  required  only  interaction  potentials  for  the  sample 
cities,  but  the  minimum  time  matrix  and  the  population  information 
for  all  the  centroids  in  the  region  were  required  for  calculation  of 
the  interaction  potentials  for  these  sample  cities;  therefore,  it  was 
decided  to  calculate  the  interaction  potential  for  all  the  centroids 
in  the  region.   Availability  of  interaction  potential  for  all  Indiana 
cities  would  give  a  better  feel  for  this  variable  and  in  case  of 
future  extension  of  the  study  to  other  cities  the  interaction 
potentials  would  be  readily  available.   The  interaction  potential  for 
all  centroids  from  1940  to  1970  are  presented  in  Tables  A2  to  A5  of 
Appendix  A. 

Intra-City  Accessibility  (X3) 

It  was  desired  to  obtain  an  index  of  overall  accessibility  for 

the  transportation  system  for  each  sample  city.   It  was  noted  that 

the  primary  mode  of  travel  in  the  small  Indiana  cities  being  studied 

was  by  highway  using  the  personal  auto.   The  accessibility  index  (51) 

n 
ace .  =  Z  =  a.  f . .  (12) 


^   j=l    J   ^J 


where 


ace  =  accessibility  index 

a.    =  number  of  attraction 
J 

f..   =  travel  time  factor  between  zones  i  and  1. 

used  in  urban  transportation  planning  was  a  satisfactory  measure  for 
this  purpose,  but  unfortunately  the  level  of  aggregation  of  the 
experiment  did  not  allow  Its  deployment  for  calculation  of  intra-city 
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accessibility.   The  scope  of  the  experiment  allows  only  the  collection 
of  data  on  the  level  of  the  city  as  a  whole  and  it  was  not  possible 
to  collect  zonal  attractions  or  minimum  time  matrix  between  the  city 
zones.   Therefore,  another  measure  of  overall  Intra-city  accessibility 
was  needed. 

Consideration  was  given  to  use  of  lane-miles  of  highways  within 
the  city  adjusted  according  to  their  functional  classification.   The 
use  of  this  measure  for  intra-city  accessibility  was  also  rejected 
because  of  the  difficulty  of  adjusting  the  actual  highway  lane  miles 
in  a  uniform  (and  repeatable)  manner  for  their  functional  classifica- 
tion.  Finally  acres  of  land  deployed  for  streets  and  highways  within 
each  city  were  used  as  proxy  for  intra-city  accessibility.   Attempts 
were  made  to  estimate  total  land  in  the  highway  right  of  way  for  this 
purpose.   It  is  often  possible  to  estimate  the  limit  of  right  of  way 
from  such  signs  as  sidewalks,  yard  lawns,  fences,  drainage  ditches, 
and  cultivation  of  abutting  lands  and  knowledge  of  approximate  widths 
of  right  of  way. 

This  operational  definition  of  intra-city  accessibility  places 
land  resource  used  for  highways  in  the  same  level  as  other  competing 
land  uses  as  defined  in  the  next  paragraph.   Therefore  the  numerical 
values  for  this  data  were  collected  with  the  data  for  other  land  uses 
as  discussed  in  the  next  section. 

Land  Use  Variables  (X4,  X8,  X9,  XIO,  Xll) 
Industrial  land  use  (X4) ,  commercial  land  use  (X8) ,  vacant  land 
use  (X9),  institutional  land  use  (XIO),  and  residential  land  use  (Xll) 


91 


were  operationally  defined  for  purpose  of  calibration  of  the  path 
model  in  acres  of  land  used  for  each  purpose  as  measured  from  aerial 
photographs. 

Prior  to  the  land  use  measurement  Table  30  was  prepared  to 
classify  land  uses  under  nine  categories  to  assist  in  the  uniform 
land  use  classifications  in  the  airphoto  interpretation  phase  of  this 
experiment.   Nine  land  use  categories  of  Table  30  were  chosen  as  to 
be  compatible  with  the  detail  available  on  the  aerial  photographs 
with  a  scale  ratio  of  1:20,000.   These  categories  were  also  compatible 
with  land  use  classification  usually  used  by  city  planners  and  were 
useful  in  the  calibration  of  the  path  model.   Actually  the  land 
categories  shown  in  Table  30  are  more  detailed  than  required  for 
calibration  of  the  path  model.   Lands  categorized  as  other  transpor- 
tation, agriculture,  and  waste  and  forest  were  not  used  in  the  cali- 
bration of  the  path  models,  the  reason  for  more  detailed  classifica- 
tion in  data  collection  than  the  one  used  in  the  path  model  was  to 
have  some  flexibility  in  case  of  later  needs  for  revision  of  the  path 
model.   Also  with  the  land  use  measurement  method  used  all  the  city's 
land  had  to  be  accounted  for. 

Method  of  Measurement  of  Land  Use  Data 

All  land  uses  were  obtained  from  vertical  aerial  photograhs  with 
a  scale  of  1:20,000  (1  inch  =  1666.66  feet)  published  by  the  United  States 
Department  of  Agriculture  (38,39).  This  scale  is  sufficient  for  study 
of  land  use  changes  in  urban  places  (40).   Most  of  the  required  1940 
photos  were  available  in  the  Joint  Highway  Research  Project  Laboratory, 


Table  30.   Classification  Used  in  Land  Use  Measurement 


92 


Land  Uses  Were  Recorded 
in  These  Categories 


Land  Use  Sub-Classification 


1.   Residential  Land  Use 


2.   Commercial  Land  Use 


3.   Industrial  Land  Use 


4.   Institutional  Land  Use 


5.   Streets  and  Highways 


6.   Other  Transportation 


7.   Vacant  and  Unused 


8.   Agriculture 


9.   Water  and  Forest 


Single  Family  Residential 
Multi-Family  Residential 
Mobile  Homes 

Retail  Establishment 
Office  Buildings 
Shopping  Centers 
Wholesale 

Manufacturing 
Mines  and  Quarries 

Institutional  Buildings  (schools, 

government  buildings) 
Institutional  Open  Space  (ballparks, 

parks,  country  clubs) 
Cemeteries 

Arterials  Highways 
Collector  Streets 
Local  Streets 

Railroad 
Airports 

Water  Transportation 
Utilities  (transmission  line-power 
plants-sanitation) 

Land  within  city  which  is  not  being 
used,  but  from  form  and  slope  it 
seems  easy  to  build  on 

Agriculture  (includes  land  used  for 
barns  and  other  farm  buildings) 

River,  Lake,  Filled  Quarries 
Woody  Areas 

Other  areas  with  rough  terrain  un- 
suitable for  building 
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but  many  of  the  photos  for  1950,  1960,  and  1970  were  borrowed  from 
Indiana  State  Highway  Commission,  and  U.S.  Geological  Survey  office 

3t  iiloomi  ngLoii ,  Indian.i. 

Land  uses  measured  from  the  aerial  photographs  were  to  be  asso- 
ciated with  tlieir  corresponding  socio-economic  data  obtained  from 
centinental  census  data.   Therefore  it  became  necessary  to  draw  on 
the  aerial  photos  the  corresponding  city  limits  used  by  the  census 
enumerators.   The  census  enumerators  used  the  regular  city  limits  for 
census  boundaries,  but  for  cities  under  50,000  population  these  city 
limits  were  recorded  in  unpublished  "place  maps"  which  for  the  first 
ten  years  are  made  available  to  the  public  thru  special  order  from  the 
Census  Bureau.   Thereafter  the  "place  maps"  may  be  purchased  from  the 
National  Archives  and  Record  Service.   Extensive  use  of  "place  maps" 
were  made  in  this  project  because  the  city  limits  were  changing,  and 
usually  it  was  not  possible  to  obtain  from  the  cities  a  map  for  the 
specific  dates  desired.   Most  cities,  evidently,  were  updating  the 
city  limit  on  the  same  original  and  did  not  try  to  keep  a  record  of 
city  limit  changes.   The  required  1970  place  maps  were  borrowed  from 
the  Geography  Department  Library  at  Indiana  University.   For  other 
dates  in  lieu  of  a  "place  map",  which  usually  was  purchased  for  the 
project,  when  possible  the  city  limits  shown  on  the  United  States 
Geological  Survey  Maps  (U.S.G.S.)  were  used. 

The  city  maps  obtained  from  different  sources  were  used  to  draw 
the  city  limits  on  the  corresponding  aerial  photographs.   The  city 
limits  used  were  usually  very  close  to  1940,  1950,  1960,  or  1970,  but 
till'  ncrl.il  |ih<)l  ())'r;i|)h,'i  were  ofLe'U  as  much  as  Lwo  lo  llifL'e  yc'ini 
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different  than  the  census  dates.   This  variation  in  the  dates  of 
aerial  photographs  were  assumed  acceptable,  because  land  uses  usually 
do  not  change  rapidly.  » 

The  vertical  aerial  photographs  used  were  most  accurate  in  the 
center  and  become  distorted  toward  the  edges.   But  fortunately,  be- 
cause of  the  50  percent  or  more  overlap  of  the  stereo  pairs,  mostly 
the  center  of  the  photographs  were  used  for  land  use  determination 
through  statistical  point  sampling  and  airphoto  interpretation. 

Statistical  Point  Sampling 

Statistical  point  sampling  was  used  in  lieu  of  delimiting 
individual  land  uses  on  the  photos  and  subsequent  area  measurement. 
The  statistical  point  sampling  has  been  used  by  the  Chicago  Area 
Transportation  Area  (CATS)  for  suburban  land  use  inventory  (41)  and 
others  (42).   In  the  statistical  point  sampling  a  previously  designed 
pattern  of  dots  on  a  transparency  is  overlaid  on  the  aerial  photos  of 
the  city  and  land  use  under  each  dot  is  recorded.   The  method  estimates 
proportion  of  population  of  points  in  a  given  area  which  fall  within 
each  land  use.   The  proportions  are  converted  to  acreage  of  land  use 
by  determining  total  acreage  of  the  area  being  inventoried.   The 
standard  error  of  the  sample  proportion  depends  on  the  density  of 
points,  and  was  determined  from  equation  11  used  by  CATS  (41). 


%-^7  <"> 


where 


a  =  standard  error  of  sample  proportion 

P   =  the  sample  proportion  of  point  in  tlie  land  use  for  which 
variance  is  being  measured 
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q  =  the  proportion  of  other  use 

N  =  the  total  number  of  points  falling  in  a  city. 

The  above  equation  assumes  randomness  of  points.   Actually  the 
points  are  not  chosen  randomly,  but  the  points  were  designed  to 
yield  a  "stratified  systematic  unaligned  sample",  and  according  to 
CATS  this  sampling  scheme  causes  the  above  equation  to  be  conservative. 
The  details  of  construction  of  a  stratified  systematic  unaligned 
sample  overlay  was  explained  in  the  Agricultural  Handbook  No.  237  (A3). 
This  is  an  efficient  sampling  procedure  because  the  resultant 
stratified  systematic  sampling  will  provide  unbiased  estimates  due 
to  lack  of  periodicities. 

The  dot  pattern  overlay  designed  for  the  current  project  is  re- 
produced in  Figure  17  which  shows  four  points  were  sampled  for  every 
one  square  inch  of  aerial  photographs.   Given  tlie  scale  of  aerial 
photos  of  1:20,000  this  sampling  density  was  slightly  over  40  dots 
(samples)  per  square  mile.   This  sampling  density  compares  with  44 
points  per  square  mile  used  in  the  CATS  suburban  land  use  survey. 

Airphoto  Interpretation 

The  decision  as  to  the  type  of  land  use  under  a  particular  dot 
was  made  by  the  use  of  the  technique  of  "airphoto  interpretation". 
This  method  was  explained  in  several  references  (4'),  46)  and  used  for 
urban  land  use  collection  by  others  (47,  48,  49).   Airphoto  interpre- 
tation of  a  particular  land  use  was  made  by  use  of  a  stereoscope  and 
a  stereo  pair  of  photos  (photos  which  overlap  in  the  direction  of 
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Notes:   1)   To  show  the  dot  pattern  clearly  the  individual  dots  are 
shown  larger  than  the  ones  used  in  overlay. 

2)   The  figure  shown  is  on  a  reduced  scale.   On  the  actual 

overlay  used  each  eight  units  of  X  or  Y  axis  was  equal  to 
one  inch. 


Figure  17.   Dot  Patterns  Designed  for  Statistical  Sampling 
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flight)  which  allows  three  dimensional  observation  of  the  terrain  and 
the  buildings.   Most  of  the  land  use  under  the  dots  were  interpreted 
with  the  aid  of  a  mirror  stereoscope  with  an  attached  telescope  which 
magnified  the  objects  in  the  photograph  three  times. 

Test  of  Statistical  Point  Sampling  Method 

Prior  to  the  land  use  data  collection  by  point  sampling  technique 
the  density  of  sampling,  as  well  as  the  skill  of  the  investigator  in 
airphoto  interpretation  were  to  be  tested.   This  was  accomplished  by 
comparing  1969  land  uses  for  Richmond,  Indiana  obtained  from  aerial 
photographs  by  statistical  point  sampling  method  with  1969  land  uses 
obtained  by  the  standard  land  use  inventory  method.   The  values  for 
inventoried  land  use  were  obtained  from  the  planning  commission  of 
city  of  Richmond. 

The  city  limits  of  Richmond  was  outlined  directly  on  12  aerial 
photographs  with  red  china  marking  pencil.   (China  pencils  are 
erasable  without  damaging  the  photos) .   Additional  green  lines 
assigned  exclusive  sections  of  the  city  to  specific  photographs  in 
such  a  manner  as  to  not  use  distorted  edges  of  the  photographs.   Dot 
pattern  transparency  was  overlaid  on  each  photograph  and  a  record  made 
of  the  relative  position  of  the  overlay  and  photograph  to  establish  a 
reference  for  X  and  Y  coordinates  of  the  overlay.   Next  the  land  use 
immediately  under  the  dot  was  classified  to  one  of  the  land  use 
categories  defined  in  Table  30.   A  record  was  made  of  the  land  use 
under  the  dot  as  well  as  the  X  and  Y  coordinates  of  the  dot.   The 
reason  for  recording  the  coordinates  of  each  dot  was  to  enable  dupli- 
cation of  the  measurement  process. 
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The  results  of  560  statistical  point  samplings  of  land  uses  in 
the  city  of  Richmond  were  the  number  of  dots  which  had  fallen  on  top 
of  nine  different  land  uses.   Given  the  fact  that  the  density  of  the 
dot  patterns  used  were  four  dots  per  one  square  inch  of  the  aerial 
photograph  and  the  fact  that  the  central  area  of  the  airphotos  have 
an  approximate  scale  of  one  inch  to  1667  feet  then  every  dot  represents 
15. 9A  (1667  X  1667  ^  4  x  A3560),  acres  of  land.   Therefore,  the  number 
of  dots  in  a  given  category  of  land  use  were  multiplied  by  15.94  to 
convert  the  measured  dots  to  its  equivalent  statistical  area  in  acres. 

The  land  use  inventory  for  Richmond  and  statistical  point  sampling 
method  were  compared  in  Table  31.   Land  use  types  as  listed  in  column 
two  of  this  table  were  obtained  from  the  Riclimond  Planning  Commission. 
Industrial  land  use  and  land  use  for  other  transportation  in  Table  30 
were  aggregated  for  incorporation  in  Table  31;  as  were  vacant, 
agriculture,  and  water  and  forest  in  Table  30  were  aggregated  for  use 
in  Table  31  under  classification  of  undeveloped  land.   Column  4  of 
Table  31  shows  the  standard  error  of  each  measurement  calculated  by 
equation  11  as  exemplified  in  the  bottom  of  the  Table  31  where  the 
standard  error  for  residential  land  use  was  calculated.   Column  6 
shows  the  95  percentile  confidence  interval  which  were  obtained 
based  on  the  assumption  of  normality  of  dot  patterns.   Although  the 
standard  error  used  in  construction  of  this  confidence  interval  was 
obtained  from  the  point  sampling  method,  the  inventoried  land  use 
values  were  used  as  best  estimates  of  the  means  of  each  land  use. 
Column  7  of  Table  31  shows  the  percentage  of  each  land  use  as  measured 
from  snniplinp,  of  aerial  photograpiis.   Comparison  of  Column  6  and 
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Column  7  shows  that  for  all  land  use  categories  the  statistically 
'  measured  land  use  was  within  the  95  percentile  confidence  limits  of 
corresponding  inventoried  land  use. 

Repeatability  of  Statistical  Point  Sampling 

To  check  on  repeatability  of  statistical  point  sampling  method, 
six  of  the  twelve  photos  covering  the  city  of  Richmond  were  randomly 
chosen  and  the  whole  statistical  point  sampling  procedure  with  the 
exception  of  drawing  the  city  limit  lines  on  the  photos  was  repeated. 
Results  of  the  second  partial  land  use  determination  ai'e  compared  in 
Table  32  with  the  land  use  measured  on  this  same  six  photos  on  the 
first  statistical  point  sampling  of  Richmond.   Comparison  of  columns 
six  and  seven  of  this  table  shows  that  the  land  use  measured  in  the 
second  run  was  within  the  95  percent  confidence  interval  of  the 
measurement  of  the  first  run.   Therefore  it  was  assumed  that 
statistical  point  sampling  may  be  used  as  an  efficient  means  of  ob- 
taining land  uses  for  calibration  of  the  transportation-city   develop- 
ment path  model. 

Collection  of  Land  Use  Data 

The  land  use  data  were  collected  for  all  but  two  of  the  sample 
cities  by  the  statistical  point  sampling  method  described  for  Richmond. 
The  two  cities  for  which  land  use  data  were  not  collected  are  the 
cities  of  Dunkirk  (1940)  for  which  city  limits  could  not  be  located 
(National  Archives  could  not  locate  its  place  map)  and  Rockville 
(1960)  for  which  aerial  photos  were  not  available.   These  two  cities 
were  not  replaced  and  it  was  assumed  that  their  omission  was  not 
affecting  the  experiment  significantly. 
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Prior  to  the  actual  data  collection  attention  was  paid  to  the 
order  in  which  the  sample  cities  were  to  be  measured.   The  ordering 
of  sample  cities  was  important  to  avoid  the  error  of  "instrumentation". 
Error  of  "instrumentation"  is  the  error  which  causes  bias  in  the  result 
of  experiment  due  to  systematic  data  measurements.   It  is  not  easy  to 
always  identify  the  sources  of  the  error  of  "instrumentation",  there- 
fore it  is  useful  to  randomize  the  order  in  which  the  data  for  sample 
cities  is  measured. 

Airphotos  for  1950,  1960,  and  1970  sample  cities  were  borrowed 
from  the  Indiana  State  Highway  Commission  and  U.S.  Geological  Survey. 
It  was  not  desirable  to  request  to  borrow  all  the  photos  for  a  long 
time.   Therefore,  as  a  compromise,  cities  in  every  study  period  were 
divided  randomly  into  three  batches  and  the  airphotos  for  these 
batches  were  interpreted  continuously,  but  with  internal  re-randomiza- 
tion. 

Tables  33  to  36  show  land  use  measurements  for  the  sample  cities 
in  1940,  1950,  1960  and  1970  respectively.   Column  1  of  these  tables 
shows  the  last  two  digits  of  the  year  of  the  aerial  photographs  (as 
well  as  its  month  in  decimal)  used  in  the  land  use  measurements. 
Columns  3  to  11  shows  the  acreage  of  different  land  uses  measured  from 
aerial  photos  for  sample  cities.   The  classification  shown  in  these 
columns  was  the  same  as  the  one  explained  in  Table  30. 

Census  Data  (X5 ,  X6 ,  X7) 
Total  city  population  (X7) ,  total  employment  (X5)  and  median 
family  income  (X6)  for  sample  cities  were  obtained  from  census 
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publications  (3,  4,  5,  6)  and  were  listed  in  Tables  37  to  AO.   Median 
family  income  for  1940  was  missing.   This  information  was  not  published 
and  a  written  inquiry  to  the  Census  Bureau  showed  that  the  1940  income 
data  was  not  collected  for  cities  under  100,000  population  (50).   Also 
research  in  Indiana  publications  within  the  state  of  Indiana 
failed  to  reveal  this  information. 

Column  3  of  these  tables  shows  the  total  employment  within  the 
city.   The  variable,  employment  per  1,000  persons  (X5) ,  was  used  in 
the  path  model  as  proxy  for  total  employment.   The  reason  for  normali- 
zation of  employment  by  population  was  to  make  employment  data  for 
different  cities  comparable. 

Statistical  Analysis  and  Results 
The  statistical  studies  were  analyzed  in  four  steps:   First  the 
path  model  of  Figure  8  was  calibrated  separately  with  1940,  1950,  1960 
and  1970  data.   Second  the  validity  of  the  calibrated  models  were 
tested  for  their  ability  to  predict  the  dependent  variables.   Third 
the  four  calibrated  models  were  qualitatively  compared  to  see  if  the 
four  models  could  be  combined  in  a  single  overall  model.   In  the  last 
step  the  final  calibrated  models  were  used  to  determine  total  (direct 
and  indirect)  effect  of  independent  variables  on  the  dependent  variables, 

Calibration  of  the  Path  Models 
The  path  model  to  be  calibrated  was  represented  by  the  nine  re- 
cursive regression  equations  6a  to  6i,  discussed  on  pages  30  to  38 
Each  of  these  nine  equations  were  represented  by  the  following  general 
equation: 
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Table  37.   Census  Data  for  19A0  Sample  Cities 


Total 

Family 

CLtv  Name 

Population 

Employment 

Median  Income 

UALPARAISO 

873G 

3202 

N.A. 

KEMDALLUILLE 

5431 

200G 

n.A. 

WARSAW 

6378 

234G 

M.A. 

HUMTIMGTOM 

13903 

4G09 

N.A. 

KOKOMO 

337S5 

11345 

n.A. 

HARTFORD  CITY 

B94B 

2393 

n.A. 

LAFAVETTE 

28798 

104G7 

n.A. 

ATTICA 

37G0 

1257 

n.A. 

FRAMKFORT 

1370G 

4G00 

n.A. 

MUISCIE 

49720 

18133 

n.A. 

RICHMOND 

35147 

1321G 

n.A. 

Gfs'EErCASTLE 

4872 

17G5 

n.A. 

COMHERSUILLE 

12898 

474G 

n.A. 

SHELBYUILLE 

10731 

3843 

n.A. 

GREEIS5BURG 

G0G5 

202G 

n.A. 

BATESUILLE 

30G5 

10S5 

n.A. 

BLOOMIMGTOM 

20070 

GOCO 

n.A. 

LIHTON 

G2G3 

1539 

N.A. 

BEDFORD 

12514 

3243 

N.A. 

UIHCEMNE5 

18228 

GOGG 

N.A. 

JASPER 

5041 

1824 

N.A. 

n.A.-  NOT  AUAILABLE 


Table  38.   Census  Data  for  1950  Sample  Cities 
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Total 

Family 

City  Name 

Population 

Employment 

Median  Income 

ROCHESTER 

4G73 

1888 

2350 

MOMTICELLO 

34G7 

14E8 

2713 

LOGAHSPORT 

21031 

8GB2 

2720 

KOKOMO 

38B72 

15905 

30G0 

HARTFORD 

7253 

2883 

2G74 

PORTLAMD 

70E4 

28G9 

2875 

FRAnKFORT 

15028 

5813 

2909 

TIPTOH 

,  5B33 

2102 

2195 

ALEXANDRIA 

5147 

2023 

20G2 

ANDERSON 

4B820 

21294 

2CC2 

NEW  CASTLE 

10271 

74G4 

3)53 

RICHMOND 

39539 

IBBbS 

2S91 

GREENCASTLE 

G88B 

2359 

G88 

SHELBYUILLE 

11734 

520G 

2G39 

FRANKLIN 

731G 

2704 

2429 

BLOOMINGTON 

281G3 

10017 

1)03 

AURORA 

4780 

1804 

2G97 

BEDFORD 

125G2 

495G 

2759 

BICKNELL 

4572 

1474 

2521 

UINCENNES 

18831 

7215 

2513 

PETERSBURG 

3035 

1025 

2042 

JASPER 

5215 

2080 

2097 

TELL  CITY 

5735 

2343 

2718 

NAPANEE 

3333 

1318 

2C59 

Table  39.   Census  Data  for  1960  Sample  Cities 


110 


Total 

Family 

City  Name 

Population 

Employment 

Median  Income 

MICHIGftM  CITV 

3GB53 

12990 

G1G3 

GOSHEN 

13718 

58E4 

GOIB 

PLYMOUTH 

7558 

2958 

5781 

LOGfthSPORT 

EllOG 

8098 

5572 

BERilE  CITY 

2S44 

1112 

G113 

LAFAYETTE 

12BS0 

4974 

80E8 

TIPTON 

5G04 

2252 

5771 

ELWOOD 

11793 

4270 

5G38 

COUIMGTON 

2759 

993 

5G48 

ANDERSON 

490G1 

201G0 

59S5 

CHESTERFIELD 

2588 

1039 

7441 

LEBANON. ULEN 

9523 

3G48 

5535 

NOBLESUILLE 

7G44 

3018 

G032 

RICHMOND 

44149 

17343 

5583 

SHELBYUILLE 

14317 

5705 

5585 

MARTINSUILLE 

7525 

2009 

5541 

EDINBURG 

3GG4 

1233 

474G 

COLUMBUS 

2077B 

8201 

G094 

BLOOMINGTON 

31357 

12182 

5443 

MADISON 

10448 

3805 

4854 

BICKNELL 

3878 

12G3 

4702 

WASHINGTON 

1084B 

38G0 

4B08 

PETERSBURG 

2939 

95G 

4431 

BOONUILLE 

4801 

1742 

4GGG 

LAURENCEBURG 

5004 

1811 

5G3a 

SCOTTSBUHG 

3810 

1453 

4928 

Table  40.   Census  Data  for  1970  Sample  Cities 
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Total 

Family 

City  Name 

Population 

Employment 

Median  Income 

MICHIGflM  CITY 

393G9 

15413 

11240 

LONG  BEACH 

2740 

983 

20997 

ELKHART  CITY 

43152 

18G03 

12812 

GOSHEN 

17171 

7825 

11181 

KNOX 

3519 

1353 

8415 

FOWLER 

2G43 

lOGl 

lOGlG 

HARTFORD 

8207 

3139 

9809 

PORTLAND 

7115 

3112 

9537 

COUINGTON 

2G41 

1090 

lOllG 

CRAUFORDSUILLE 

13842 

5542 

10d48 

UPLAND 

3203 

124B 

9080 

NEW  CASTLE 

21215 

7SB3 

9894 

RICHMOND 

43399 

17785 

104C9 

CONNERSUILLE 

17G04 

G7G4 

lOOSG 

COLUMDUS 

27141 

11073 

11948 

BLOOMINGTON 

42890 

1GGG9 

10773 

MADISON 

13081 

5035 

10512 

- 

BICKNELL 

3717 

1230 

753S 

WASHINGTON 

11358 

3042 

9122 

JASPER 

8G41 

3495 

HUB 

FORTRRANCH 

2535 

987 

9791 

BOOnUILLE 

573G 

210B 

10332 

BLOONFIELD 

25G5 

1053 

10099 

DELPHI 

2532 

1180 

1016G 
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X.  =  3„  +  T.      3.  X.  +  e 
^    °   J=l  J   J 

The  calibration  process  used  the  values  of  the  observed  data  to 

estimate  the  empirical  coefficients,  b.,  b, ,  ....,  b   (corresponding 

0   1         n 

to  the  theoretical  coefficients  3^,.  6,,  ....,  3  )• 

The  regression  coefficients  were  estimated  by  the  ordinary  least 
square  regression  analysis.   The  computations  were  performed  on  the 
CDC  6500  computer  at  Purdue  University  using  the  "backward  elimination" 
procedure  (52)  of  the  statistical  package  for  the  social  sciences 
computer  program  (SPSS)  (53).   The  program  output  simultaneously 
produces  regression  coefficients  and  path  coefficients.   The  regression 
coefficients  V7hich  upon  verification  were  used  for  prediction  purposes 
were  calculated  by  use  of  the  raw  observed  data  and  the  path  coeffi- 
cients were  calculated  with  the  standardized  data  (mean  =  0  and  variance 
=  1). 

The  "backward  elimination"  procedure  first  develops  a  regression 
equation  with  all  the  hypothesized  variables  in  the  model  then: 

1.  Calculates  partial  F  values  for  each  of  the  model  variables 
assuming  that  the  remainder  of  the  variables  were  already  in 
the  model. 

2.  Compares  the  lowest  of  the  partial  F's  with  a  critical  F 
value  computed  for  a  specified  significance  level  (F  critical 
was  chosen  for  a  =  0.05). 

3.  As  long  as  the  lowest  partial  F  value  was  less  than  the 
critical  F  value,  the  corresponding  variable  was  dropped  from 
the  regression  equation  and  step  2  was  repeated.   Otherwise 
the  regression  equation  was  accepted. 
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Hypothesis  Testing  of  Regression  Coefficients 
Each  of  the  variables  in  the  final  regression  equations  were 
tested  for  significant  contribution  to  the  equation  after  due  consider- 
ation to  the  influence  of  other  variables  already  in  the  equation.   If 
an  independent  variable  significantly  affects  the  regression  equation, 
its  corresponding  regression  coefficient  was  non  zero.   The  null 
hypothesis  (H^)  assumes  that  the  regression  coefficient  was  zero  and 
the  alternative  hypothesis  (H  )  assumes  that  the  regression  coefficient 
was  non-zero.   If  the  null  hypothesis  was  rejected  then  the  alternative 
hypothesis  was  not  rejected. 

Rejection  of  the  null  hypothesis,  in  part,  depends  on  the  level 
of  significance  used  in  the  hypothesis  testing.   In  this  project  a 
significance  level  of  five  percent  was  used  (a  =  type  I  error  =  0.05); 
which  means  there  was  a  five  percent  chance  of  erroneously  rejecting 
the  null  hypothesis  when  it  should  have  been  accepted.   The  specific 
test  procedure  for  each  regression  coefficient  follows: 

1.  H„,  null  hypothesis:  b.  =  0 

H, ,  alternative  hypothesis:    b.  ^  0  on 

2.  Estimated  statistic,  Partial  F:   F(K,n-K-l)  =  z 

(l-R^)/(n-K-l) 

where: 

n  =  number  of  observations 

K  =  number  of  independent  variables 

2 
R  =  coefficient  of  multiple  determination 

3.  Critical  F  Statistic:   F(K,n-K-l,l-a)  where  K  and  n-K-1  are 
appropriate  degrees  of  freedom. 
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4.   Reject  the  null  hypothesis  if: 

estimated  statistic  >  critical  statistic 

Results  of  1940,  1950,  1960  and  1970  Model  Calibrations 
The  results  of  model  calibrations  for  the  1940,  1950,  1960  and 
1970  data  periods  are  shown  in  Tables  41  to  44  and  Figures  18  to  21. 
The  tables  present  the  calibrated  path  model  with  unnormalized 
variables  and  regression  coefficients.   The  equation  in  these  tables 
after  verification  were  used  for  predictive  purposes.   The  figures 
show  the  same  calibrated  model,  but  with  the  normalized  variables  and 
path  coefficients.   These  path  coefficients  were  used  to  obtain  the 
direct  and  indirect  effect  of  an  independent  variable  on  an  appropriate 
dependent  variable. 

The  results  of  the  calibration  process  show  that  generally  the 
dependent  variables  were  adequately  explained  by  the  models,  but  there 
were  differences  between  the  level  of  explanation  of  the  four  models. 
In  the  discussion  of  the  ability  of  the  equations  to  explain  variations 
of  the  dependent  variables  it  was  noted,  that  there  are  two  distinct 
sets  of  variables  in  the  transportation-  city  development  path  model 
(endogenous  and  exogenous).   The  purpose  of  the  hypothesized  general 
model  was  to  explain  variations  of  residential,  institutional,  vacant, 
commercial,  and  industrial  land  use  (endogenous  variables),  while  the 
purpose  of  inclusion  of  city  locations,  inter-city  accessibility,  intra- 
city  accessibility,  employment,  median  family  income,  and  population 
(exogenous  variables)  in  the  model  was  to  assist  in  explanation  of 
variation  of  land  uses. 
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It  was  expected  the  equations  having  the  endogenous  variables  as 

dependent  would  show  a  higher  level  of  explanation  of  dependent 

2 
variables  (have  higher  R  )  than  the  equations  having  the  exogenous 

variables  as  dependent.   The  latter  equations  were  required  for 

calculations  of  total  effects  coefficients,  but  a  high  level  of 

explanation  was  not  expected  from  them. 


1940  Path  Model 
The  original  hypothesized  model  of  Figure  6  has  54  links,  but  be- 
cause of  unavailability  of  median  family  income  for  1940  the  number  of 
links  in  the  1940  hypothesized  model  was  reduced  to  44.   Figure  18 
shows  the  calibrated  1940  model  to  have  retained  12  links  at  a  signifi- 
cance level  of  95  percent.   These  12  links  explain  most  of  the 

variations  of  the  dependent  variables  as  shown  by  relatively  high 

2 
coefficients  of  determinations  (R  )  associated  with  their  respective 

regression  equation  shown  in  Table  41.   The  calibrated  model  does  not 

describe  the  variations  of  each  of  the  dependent  variables  equally. 

The  following  discussion  pertains  to  the  model's  ability  to  explain 

each  of  the  variables  and  reasonableness  of  the  associated  regression 

equations . 

Residential  Land  Use 

About  ninety-three  percent  of  the  variations  in  the  acres  of  land 
allocated  for  residential  land  use  in  1940  sample  cities  was  explained 
by  the  calibrated  models  in  agreement  with  the  hypothesized  assumption. 
Residential  land  use  was  positively  correlated  with  city  location  and 
city  population.   The  residual  of  this  equation  had  no  pattern;  although, 
the  absolute  amount  of  the  error  becomes  larger  for  larger  cities. 
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Institutional  Land  Use 

About  seventy-seven  percent  variations  of  institutional  land  use 
was  explained  by  the  1940  model.   Institutional  land  use  was  positively 
correlated  with  city  population  and  negatively  correlated  with 
commercial  land  use.   Positive  correlation  of  institutional  land  use 
with  city  population  is  in  agreement  with  the  hypothesized  path  model, 
but  its  negative  correlation  with  commercial  land  use  was  in  disagree- 
ment with  the  hypothesized  model  and  not  readily  explained. 

Vacant  Land  Use 

About  seventy-five  percent  of  variation  in  vacant  land  use  in  the 
sample  cities  was  explained  by  the  model.   Vacant  land  use  was 
positively  related  to  acres  of  land  used  for  city  streets  and  negatively 
related  to  commercial  land  use.   The  dependent  variable  (vacant  land 
use)  as  used  in  this  study  was  in  reality  the  land  reserved  for  future 
use.   It  would  seem  reasonable  that  highway  and  street  construction 
would  increase  the  amount  of  land  reserved  for  future  use.   The 
negative  influence  of  commercial  land  use  on  the  dependent  variable 
was  reasonable  because  more  commercial  land  use  signifies  a  higher 
degree  of  land  use  and  less  availability  of  vacant  land  within  the  con- 
fines of  the  study  where  the  size  of  each  city  was  limited  to  the  land 
within  the  city  limits. 
Commercial  Land  Use 

Only  about  forty-nine  percent  of  variation  in  the  commercial  land 
use  was  explained  by  the  two  independent  variables  in  the  equation. 
The  industrial  land  use  was  negatively  related  with  commercial  land 
use.   Tills  nu);nLlvo  rela  tlonslil  p  was  contrary  to  prior  belief  and  not 
readily  explainable.   Furthermore,  the  results  obtained  from  the  rest 
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of  the  data  (see  Table  42)  shows  a  positive  relation  between  the 
Industrial  land  use  and  commercial  land  use.   This  leads  to  the  belief 
that  either  the  1940  sample  data  does  not  represent  the  inference  space 
adequately  or  the  possibility  of  significant  differences  in  industrial- 
commercial  land  use  patterns  before  and  after  World  War  II.   The  second 
variable,  population,  was  positively  related  to  the  commercial  land 
use,  which  was  in  agreement  with  hypothesized  assumptions. 

City  Population 

City  population  was  not  truly  a  dependent  variable,  but  was 
entered  in  the  model  to  help  explain  variations  in  the  land  uses. 
Ninety-one  percent  of  the  variation  in  the  population  was  explained 
by  the  industrial  land  use  and  the  city's  street   system.   Positive 
correlation  of  industrial  land  use  and  city  streets  with  population  is 
in  agreement  with  the  assumptions  of  the  hypothesized  path  model. 

Employment  Per  1000  Persons 

The  model  explains  only  twenty-two  percent  of  variation  in  city 

2 
employment  by  variation  of  inter-city  accessibility.   Such  a  low  R 

by  itself  would  not  necessarily  be  unsatisfactory  because  employment 

was  not  really  a  dependent  variable,  but  it  was  in  the  model  to  help  to 

explain  variations  in  land  uses.   Unfortunately  in  this  model  there 

were  no  significant  relationships  between  employment  and  any  of  the 

land  uses;  as  a  result  the  employment  variable  as  used  in  this  model 

does  not  serve  any  useful  purpose. 
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Industrial  Land  Use 

Fifty-two  percent  of  variations  in  the  industrial  land  use  was 
explained  by  the  variations  in  the  street  system.   There  was  a  linear 
trend  on  the  residuals  of  this  regression  equation.   The  reason  for 
the  trend  was  that  one  or  more  significant  independent  variables  were 
missing  from  this  equation.   Nevertheless  it  was  significant  that 
fifty  percent  of  variations  in  industrial  land  use  was  explained  by 
the  land  used  for  the  city  streets. 

Intra-City  Accessibility 

There  was  no  satisfactory  regression  equation  of  intra-city 
accessibility  with  city  location  and  inter-city  accessibility.   There- 
fore, the  original  hypothesis  statement  that  location  and  inter-city 
accessibility  influence  intra-city  accessibility  was  not  verified. 

Location  and  Inter-City  Accessibility 

These  two  variables  were  highly  correlated  (correlation 
coefficient,  r  =  0.87)  the  explanation  of  this  correlation  was  out- 
side  the  scope  of  the  model. 

1950  Path  Model 
From  the  original  54  causal  links  in  the  generalized  transporta- 
tion-city development  path  model  the  relationship  represented  by  11  of 
these  links  was  confirmed.   Table  42  and  Figure  19  show  these  signifi- 
cant links.   The  1950  model  explains  variations  of  residential,  in- 
stitutional, commercial,  and  industrial  land  use  adequately,  but  the 
variations  of  the  vacant  land  use  was  not  explained  adequately. 
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Residential  Land  Use 

Ninety-five  percent  of  variation  of  residential  land  use  in  the 
1950  sample  cities  was  explained  by  variation  of  city  population  and 
median  family  income.   Correlation  of  higher  income  and  higher  resi- 
dential land  use  was  reasonable  because  generally  the  more  affluent 
population  live  in  residential  units  with  the  larger  lots. 

Institutional  Land  Use 


About  seventy-six  percent  of  variation  in  the  1950  institutional 
land  use  was  explained  by  the  city  population.   As  institutional  land 
use  was  there  to  serve  the  people  this  equation  was  intuitively 
correct  and  in  agreement  with  prior  assumptions. 

Vacant  Land  Use 

Only  forty-four  percent  of  variations  in . the  vacant  land  was 
explained  by  the  1950  model.   Vacant  land  use  varies  negatively  with 
commercial  land  use,  and  positively  with  the  city's  street  system. 

Commercial  Land  Use 

Seventy-three  percent  of  variations  in  the  commercial  land  use 
was  explained  by  the  industrial  land  use.   This  was  in  agreement  with 
the  assumptions  that  commercial  development  follows  the  industrial 
development  to  serve  the  industry  and  the  workers. 

City  Population 

Ninety-six  percent  of  the  variation  in  the  city  population  was 

explained  by  the  1950  model.   City  population  was  negatively  related 

to  median  family  income  and  positively  related  to  the  city's  street 
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system.   The  negative  relation  between  income  and  population  implies 
that,  other  conditions  being  equal,  city's  with  a  higher  median  family 
income  have  a  smaller  number  of  inhabitants. 

Median  Family  Income 

Only  thirty-eight  percent  of  the  variation  in  the  median  family 

income  was  explained  by  the  variation  among  employment  per  1000  persons 

2 
in  the  1950  model.   But  this  low  R  was  not  unsatisfactory,  because  the 

purpose  of  the  model  was  not  to  predict  median  family  income,  but  to 

deploy  this  variable  to  explain  variations  in  land  uses.   The  positive 

correlation  of  median  family  income  with  employment  per  1000  persons 

was  in  agreement  with  the  hypothesized  assumptions. 

Employment  Per  1000  Persons 

Only  thirty-one  percent  of  the  variation  in  the  employment  rate 
was  explained  by  the  variation  in  the  Industrial  land  use  in  the  1950 
model.   The  correlation  of  employment  rate  and  industrial  land  use  was 
intuitively  correct  and  in  agreement  with  prior  assumptions.   Evidently 
the  employment  rate  depends  on  other  factors  not  in  the  equation,  such 
as  the  number  of  workers  who  commute  to  other  cities  for  work.   Never- 
theless this  relationship  was  useful  because  its  purpose  was  to  assist 
in  the  explanation  of  the  variations  of  the  land  use  and  not  to  predict 
employment  rate. 

Industrial  Land  Use 


About  eighty-three  percent  of  the  variation  in  the  industrial  land 
use  was  explained  by  the  variation  in  the  city's  street  system  by  the 
1950  model. 
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Intra-City  Accessibility 

There  were  no  satisfactory  regression  equations  to  explain  the 
variations  of  intra-city  accessibility  with  city  location  and  inter- 
city accessibility. 

Location  and  Intcr-City  Accessibility 

These  two  variables  were  highly  correlated  (correlation  coefficient, 
r   „  =  0.84).   The  explanation  of  this  correlation  was  outside  the  scope 
of  the  hypothesized  path  model. 

1960  Path  Model 
The  original  hypothesized  model  of  Figure  6  has  5A  links.   The 
calibrated  1960  model  as  presented  in  Table  43  and  Figure  20  confirms 
19  of  these  links.   These  19  links  adequately  explain  variations  of 
residential,  institutional,  vacant,  and  industrial  land  use,  but  do 
not  adequately  explain  the  variations  in  the  commercial  land  use. 

Residential  Land  Use 

Ninety-eight  percent  of  the  variation  in  the  residential  land  use 
was  explained  by  the  1960  model.   Residential  land  use  was  negatively 
related  with  employment  per  1000  persons  and  institutional  land  use; 
and  positively  related  with  city  location,  intra-city  accessibility, 
and  population.   The  negative  correlation  of  residential  land  use  with 
employment  and  institutional  land  use  was  contrary  to  original  assump- 
tions, and  their  implication  was  counter- intuitive.   Negative 
correlation  of  employment  per  1000  persons  and  institutional  land  use 
with  residential  land  use  implies  that  the  city's  with  higher  per 
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capita  employment  and  higher  institutional  land  use  allocate  smaller 
acreage  for  residential  land  use. 

Institutional  Land  Use 

About  ninety-six  percent  of  the  variation  in  the  institutional 
land  use  was  explained  in  the  1960  model.   Institutional  land  use  was 
positively  related  with  population  and  vacant  land;  and  negatively 
related  with  commercial  and  industrial  land  use.   The  implication  of 
negative  correlation  was  that  cities  with  more  industrial  and 
commercial  land  use  allocate  less  land  for  institutional  purposes. 

Vacant  Land  Use 

About  eighty-seven  percent  of  the  variation  of  vacant  land  use 
was  explained  by  the  1960  model.   Vacant  land  use  was  positively  re- 
lated with  intra-city  accessibility  and  negatively  with  population. 
This  equation  seems  reasonable  because  a  more  extensive  street  system 
creates  more  vacant  land  (land  reserved  for  future  use)  and  a  larger 
population  creates  pressures  for  deployment  of  the  reserved  land. 

Commercial  Land  Use 

Only  about  forty-two  percent  of  the  variation  in  the  commercial 
land  use  was  explained  by  the  1960  model.   Commercial  land  use  was 
positively  related  with  the  intra-city  accessibility.   This  positive 
correlation  of  commercial  land  use  and  intra-city  accessiblity  seems 
reasonable. 
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City  Population 

About  ninety  percent  of  the  variation  in  the  city  population  was 
explained  by  the  1960  model.   City  population  was  positively  correlated 
with  employment  and  intra-city  accessibility,  and  negatively  correlated 
with  inter-city  accessibility.   The  negative  correlation  of  city  popu- 
lation and  inter-city  accessibility  would  seem  to  be  counter-intuitive. 
Inter-city  accessibility  was  a  function  of  the  region  wide  highway 
system  and  the  population  of  all  other  cities  in  the  region.   The 
negative  correlation  between  the  population  and  inter-city  accessibility 

may  imply  that:  when  the  smaller  city  was  located  near  a  larger  city; 

the  larger  city  was  negatively  influencing  the  growth  of  the  smaller  city. 

Median  Family  Income 

Only  forty-eight  percent  of  the  variation  in  the  median  family 
income  was  explained  by  the  1960  model.   Median  family  income  was 
positively  related  with  the  city  location  and  employment  per  1000 
persons.   The  positive  correlation  of  median  family  income  and  em- 
ployment rate  intuitively  seems  reasonable.   The  positive  correlation 
of  income  and  city  location  implies  that  for  a  given  employment  rate 
the  cities  in  the  northern  part  of  Indiana  have  a  higher  median  family 
income  than  cities  in  southern  Indiana. 

Employment  Per  1000  Persons 

Only  twenty-seven  percent  of  the  variation  in  the  employment  rate 
was  explained  in  the  1960  model.  Employment  was  positively  correlated 
with  inter-city  accessibility  which  seems  reasonable. 
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Industrial  Land  Use 

About  eighty-two  percent  of  the  variation  in  the  industrial  land 
use  was  explained  by  the  1960  model.  Industrial  land  use  was  positively 
related  with  intra-city  accessibility,  which  was  in  agreement  with 
prior  assumptions. 

Intra-City  Accessibility 

Only  twenty-two  percent  of  the  variation  in  the  intra-city 
accessibility  was  explained  by  the  1960  model.   Intra-city  accessibility 
was  positively  related  with  inter-city  accessibility,  which  implies 
that,  other  conditions  being  equal,  cities  with  better  access  to  other 
cities  also  have  a  better  street  system. 

Location  and  Inter-City  Accessibility 

These  two  variables  were  highly  correlated  (correlation  coeffi- 
cients, r^  „  =  0.90).   The  explanation  of  this  correlation  was  outside 
the  scope  of  the  hypothesized  path  model. 

1970  Path  Model 
Calibration  of  the  path  model  of  Figure  7  with  the  1970  data  con- 
firms 18  of  the  54  original  hypothesized  links.   These  18  links 
adequately  explain  the  variations  of  all  1970  land  uses  with  the  ex- 
ception of  commercial  land  use  which  was  not  adequately  explained. 
Table  44  and  Figure  21  contain  the  calibration  results  of  the  1970 
path  model. 
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Residential  Land  Use 

About  ninety-eight  percent  of  the  variations  of  residential  land 
use  was  explained  by  the  1970  model.   Residential  land  use  was 
positively  correlated  with  commercial  land  use,  inter-city  accessi- 
bility, intra-city  accessibility,  and  city  population;  and  negatively 
correlated  with  city  location.   The  positive  correlations  were  in 
agreement  with  the  hypothesized  model  and  intuitively  reasonable.   The 
negative  correlation  with  the  city  location  implies  that,  other  con- 
ditions being  equal,  the  southern  Indiana  cities  allocate  less  land 
for  residential  land  use  than  cities  in  northern  Indiana. 

Institutional  Land  Use 

About  ninety-four  percent  of  the  variation  in  the  institutional 
land  use  was  explained  by  the  1970  model.   Institutional  land  use,  in 
agreement  with  the  hypothesized  model,  was  positively  correlated  with 
employment  per  1000  persons,  vacant  land  use  and  population,  but  con- 
trary to  prior  assumption  was  negatively  correlated  with  industrial 
land  use.   This  negative  correlation  may  imply  that,  other  conditions 
being  equal,  highly  industrialized  cities  use  less  of  their  land  for 
public  service  purposes. 

Vacant  Land  Use 

About  eight-nine  percent  of  the  variation  of  the  vacant  land  use 
was  explained  by  the  1970  calibrated  model.   Vacant  land  use  was 
correlated  positively  with  Intra-city  accessibility,  which  was  in 
agreement  with  the  hypothesized  assumptions. 
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Commercial  Land  Use 

Only  fifty-three  percent  of  variations  in  the  commercial  land  use 
was  explained  by  the  1970  calibrated  model.   But  positive  correlation 
of  commercial  land  use  and  population  was  in  agreement  with  the 
hypothesized  assumptions. 

City  Population 

About  sixty-eight  percent  variations  of  city  population  was  ex- 
plained by  the  calibrated  1970  model.   The  positive  correlation  of 
city  population  with  industrial  land  use  was  in  agreement  with  the 
hypothesized  assumptions. 

Median  Family  Income 

At  95  percent  significance  level  (a  =  0.05)  there  were  no  signi- 
ficant regression  equation  developed  to  explain  the  variations  of 
median  family  income.   Therefore  no  equation  for  median  family  income 
was  shown  in  the  Table  44.   But,  the  following  equation: 

Median  Family  Income  =  10,526.9  +  12.08  City  Location 

which  explains  fifteen  percent  of  variations  in  median  family  income 
was  significant  at  the  94.2  percent  (a  =  0.058).   This  equation  im- 
plies, all  other  conditions  being  equal,  a  higher  median  family  income 
in  the  northern  Indiana  cities  than  southern  Indiana  cities. 

Employment  Per  1000  Persons 

Only  about  twenty-seven  percent  of  the  variation  in  the  employ- 
ment rate  was  explained  by  the  1970  calibrated  model.   Employment  rate 
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was  correlated  negatively  with  intra-city  accessibility  and  positively 
with  industrial  land  use.   The  negative  correlation  of  employment  rate 
with  intra-city  accessibility  was  contrary  to  the  hypothesized  assump- 
tion and  in  disagreement  with  the  results  obtained  from  the  1940,  1950, 
and  1960  calibrated  models.   The  positive  correlation,  of  employment 
rate  and  industrial  land  use  was  in  agreement  with  the  hypothesized 
assumptions. 

Industrial  Land  Use 


Eighty-eight  percent  variation  of  industrial  land  use  was  ex- 
plained by  the  1970  calibrated  model.   Industrial  land  use  was 
positively  correlated  with  both  intra-city  accessibility  and  inter- 
city accessibility.   These  correlations  were  in  agreement  with 
hypothesized  assumptions  and  may  mean  that  better  inter  and  intra-city 
accessibility  positively  influence  industrial  development. 

Intra-City  Accessibility 

Only  twenty-four  percent  variation  of  the  intra-city  accessibility 
was  explained  by  the  1970  model.   The  intra-city  accessibility  was 
positively  correlated  with  inter-city  accessibility,  which  may  mean 
that  cities  with  higher  inter-city  accessibility  also  have  a  higher 
intra-city  accessibility.   The  intra-city  accessibility  was  negatively 
related  with  city  location,  which  implies  northern  Indiana  cities 
deploy  less  land  for  their  street  system  compared  with  cities  in 
southern  Indiana. 
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Location  and  Inter-Clty  Accessibility 

These  two  variables  were  highly  correlated  (correlation  coef- 
ficient, r  „  =  0.94).   The  explanation  of  this  correlation  was  out- 
side  the  scope  of  the  hypothesized  path  model. 

Ability  of  Calibrated  Path  Models  to  Predict  Land  Use  Variables 

General  path  model  of  Figure  6  in  the  previous  section  was  cali- 
brated separately  with  the  collected  data  for  1940,  1950,  1960  and 
1970.  An  analysis  was  made  to  determine  whether  the  calibrated  models 
were  capable  of  predicting  land  uses  within  the  95  percent  confidence 
interval  or  not.  For  this  purpose  it  was  necessary  to  have  available 
data  for  cities  within  the  inference  space  which  were  not  used  in  the 
calibration  of  the  model  under  investigation.  But  in  the  calibration 
process  of  the  last  section  it  was  desirable  to  use  all  the  available 
information  to  obtain  the  best  possible  models,  and  there  were  no  un- 
used data  to  test  the  calibrated  models  of  Tables  41  to  44. 

To  overcome  this  difficulty,  three  cities  were  removed  from  the 
data  set  for  each  of  the  four  decades  and  the  remainder  of  the  data 
were  used  to  recalibrate  the  parameters  of  the  regression  equations. 
The  three  cities  removed  from  each  data  set  were  chosen  randomly;  one 
each  from  the  three  different  city  size  groups.   The  new  calibration 
process  for  each  regression  equation  did  not  begin  with  the  list  of 
independent  variables  hypothesized  by  the  original  model  of  Figure  6, 
but  used  the  significant  independent  variables  determined  by  the 
previous  calibration  process  to  estimate  the  new  parameters  associated 
with  the  reduced  data  set. 
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The  recalibrated  regression  equations  were  then  used  to  estimate 
the  95  percent  confidence  interval  for  the  land  uses  of  the  three 
cities  not  used  in  their  calibration  process.   If  the  measured  land 
uses  fell  in  the  95  percent  confidence  interval  predicted  by  the  re- 
calibrated models  then  the  calibrated  models  were  valid  to  predict 
land  use  for  other  cities  in  the  inference  space  at  the  95  percent 
significant  level. 

Estimation  of  the  standard  error  for  the  predicted  mean  value  of 
a  dependent  variable  depends  on  the  specified  values  of  the  set  of 
independent  variables  and  was  given  by  the  following  equation  (54)  in 
the  matrix  notation: 

^Y(mean)  =  ^^^  ^^0  (^'^)"\) 
where 

MSE     =  mean  square  error 

X„      =  set  of  specified  independent  variables  for  which 

standard  error  is  to  be  estimated 
X       =  set  of  independent  variables  used  to  calibrate  the 

regression  equation 
X',X'    =  transpose  of  the  respective  matrix 
(X'X)    =  inverse  of  the  X'X  matrix. 
The  present  task  requires  prediction  of  each  dependent  variable  based 
on  a  single  observation  for  a  specified  set  of  independent  variables. 
The  predicted  value  of  this  dependent  variable  was  given  by  the  Y,  but 
the  standard  error  was  larger  than  given  by  the  above  formula,  and  was 
given  by 
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S^  =  MSE  (1+X^(X'X)  h^) 

As  a  result,  the  95  percent  confidence  interval  for  the  predicted 
value  of  a  single  observation  was  given  by 

Y  +  t  (0.975;  n-P-lXS^) 

where 

n  =  number  of  observations  used  to  calibrate  the  regression 
equation 

P  =  number  of  independent  variables  in  the  regression  equation. 

Program  CPIYL,  prepared  by  Purdue  University  statistics  department 
(55),  was  used  to  recalibrate  all  the  regression  equations,  with  one 
of  the  land  uses  as  the  dependent  variable  and  It  was  used  to  calcu- 
late the  standard  error,  and  the  95  percent  confidence  interval  for 
the  three  cities  not  used  in  the  recalibration  of  the  models. 

Tables  45  to  48  contain  the  results  from  the  above  analysis  for 
1940,  1950,  1960,  and  1970  data  sets.   In  each  table  the  measured 
value,  the  predicted  value,  the  standard  error,  and  the  95  percent 
confidence  interval  for  each  of  the  five  land  uses  were  given  for  the 
three  cities  not  used  in  the  recalibration  of  the  regression  equations. 
The  results  presented  in  Tables  45  to  48  were  conservative  in  the 
sense  that  the  original  calibrated  equations  as  presented  in  Tables 
41  to  44  (which  were  used  for  any  other  prediction  of  land  uses)  were 
better  equations  (they  have  larger  degrees  of  freedom  and  smaller  mean 
square  error)  than  the  recalibrated  equations  used  to  produce  the 
models  contained  in  Tables  45  to  48. 
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Tables  45  to  48  show  that  generally  the  predicted  95  percent 
confidence  intervals  contained  the  values  of  corresponding  measured 
values  of  land  uses.   Comparison  of  60  measurements  of  land  uses  with 
their  respective  predicted  95  percent  confidence  interval  show  that 
all  but  three  of  the  measurements  fall  within  the  predicted  95  percent 
confidence  intervals.   The  three  measurements  which  were  not  predicted 
at  the  95  percent  confidence  level  with  the  recalibrated  equations 
were  the  1960  vacant  and  commercial  land  use  for  Michigan  City  and 
the  1970  industrial  land  use  for  the  city  of  Elkhart. 

Qualitative  Comparison  of  the  Four  Calibrated  Models 
In  the  previous  section,  results  obtained  from  the  calibrations 
of  1940,  1950,  1960  and  1970  models  was  separately  reported.   The 
five  equations  (obtained  from  the  four  models)  which  explain  each  of 
the  land  uses  were  compared  and  any  difference  between  the  way  each 
of  the  four  models  explain  the  variations  of  the  land  uses  was  reported 
to  study  the  feasibility  of  combining  the  data  for  the  four  models 
into  an  overall  model.   In  evaluating  the  difference  between  two 
equations  which  contain  different  independent  variables  to  explain  the 
variations  of  the  same  dependent  variable,  it  was  important  to  know 
the  contribution  of  each  independent  variable  to  the  explanation  of 
the  variations  of  the  dependent  variable.   Assume  the  two  equations 
each  contain  three  independent  variables  with  only  one  common  indepen- 
dent variable.   At  first  glance,  the  two  equations  seem  quite 
different  but  if  the  common  independent  variable  contributes  toward 
ninety  percent  of  the  variation  of  the  dependent  variable,  the  two 
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different  equations  are  indeed  similar.   The  elasticity  of  each  in- 
dependent variable  measures  the  contribution  of  an  independent  variable 
toward  explanation  of  the  variations  of  the  dependent  variable. 

Vi 

Elasticity,  e  =  —^   .  was  given  as  part  of  the  SPSS  regression  output 
(56),  where: 

B,  =  unnormalized  regression  coefficient  for  the  independent 
variable, 

X.  =  mean  of  the  ith  variable,  Y  =  mean  of  the  dependent  variable. 
Table  49  shows  the  Independent  variables  that  explain  variations  of 
each  of  the  five  land  uses  by  the  four  models  and  the  elasticity 
associated  with  each  of  the  regression  coefficients.   This  table  was 
utilized  to  examine  independent  variables  used  to  explain  variations 
of  each  of  the  land  uses  by  the  four  models. 

Qualitative  Comparison  of  the  Four  Equations 
Predicting  Residential  Land  Use 
All  the  four  models  show  that  residential  land  use  was  positively 
correlated  with  the  city's  population.   The  influence  of  city  popula- 
tion on  residential  land  use  was  substantial  in  the  1940,  1950,  and 
1960  model,  but  was  only  moderate  in  the  1970  model.   City  location 
influenced  residential  land  use  positively  in  the  1940  and  1950  models 
and  negatively  in  the  1970  model,  but  elasticities  of  city  location 
variables  were  respectively,  0.018,  0.002,  and  0.048  which  means  the 
contribution  of  the  city  location  variable  in  explanation  of  variation 
of  residential  land  use  was  practically  negligible.   For  example,  city 
location  explains  two  percent  (0.018/0.868)  of  variation  of  residential 
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land  use  In  the  19A0  model.   The  1950  model  was  the  only  model  which 
contained  median  family  income  to  explain  residential  land  use.   Intra- 
city  accessibility  was  utilized  in  both  the  1960  and  the  1970  models. 

In  the  1950  model,  institutional  land  use  and  employment  per  1000 
persons  negatively  influence  residential  land  use.   The  above  negative 
correlations  were  counter- intuitive.   The  negative  influence  of  in- 
stitutional land  use  with  elasticity  of  0.166  was  negligible  (7.3  per- 
cent), but  the  negative  influence  of  employment  per  1000  persons  was 
substantial  (45.6  percent).   Only  in  the  1970  model  was  inter-city 
accessibility  and  commercial  land  use  influencing  the  residential 
land  use.   The  influence  of  inter-city  accessibility  with  elasticity 
of  1.764  was  substantial  (66.  6  percent) ,  but  influence  of  commercial 
land  use  with  elasticity  of  0.141  was  negligible  (5.3  percent). 

Qualitative  Comparison  of  the  Four  Equations 
Predicting  Institutional  Land  Use 
The  1950  model  contains  population  as  the  only  independent 
variable  to  explain  variations  in  the  institutional  land  use.   The 
population  variable  also  contributes  substantially  to  explanation  of 
the  variation  of  institutional  land  use  in  the  other  three  models.   In 
the  1940  model  and  1960  model,  the  commercial  land  use  was  negatively 
correlated  with  institutional  land  use.   Both  the  1960  and  the  1970 
models  use  the  industrial  land  use  and  vacant  land  use  to  explain  part 
of  the  variation  of  institutional  land  use.   The  negative  correlation 
of  industrial  land  use  with  institutional  land  use  was  contrary  to  the 
hypothesized  assumptions  and  imply  that,  other  conditions  being  equal, 
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the  industrialized  cities  were  allocating  less  land  for  public  service 
purposes.   Employment  per  1000  persons  was  used  only  in  the  1970 
model.   The  elasticity  of  employment  per  1000  persons  was  2.828  which 
means  the  employment  rate  explains  A9  percent  of  the  variation  of  in- 
stitutional land  use  in  the  1970  model.   The  implication  of  the  re- 
sults obtained  from  the  1970  model  was  that  the  so  called  "bedroom" 
cities  or  those  cities  with  higher  employment  rate  but  fewer  industries 
use  more  of  their  land  for  public  services. 

Qualitative  Comparison  of  the  Four  Equations 
Predicting  Vacant  Land  Use 
Both  the  1940  and  1950  models  use  commercial  land  use  and  intra- 
city  accessibility  to  explain  variations  of  vacant  land  use.   In  both 
models  the  influence  of  commercial  land  use  was  negative  and  the  in- 
fluence of  intra-city  accessibility  was  positive.   The  elasticities 
of  the  independent  variable  in  the  two  equations  were  about  the  same. 
The  positive  correlation  of  vacant  land  use  and  intra-city  accessibility 
was  due  to  the  fact  that  a  larger  street  system  creates  more  land  re- 
served for  the  future  use,  and  the  negative  correlation  of  vacant  land 
with  commercial  land  use  implies  decreasing  quantities  of  land  reserved 
for  future  use  at  later  stages  of  development.   Intra-city  accessibility 
was  also  used  to  explain,  partially,  the  variations  of  vacant  land  use 
in  the  1960  model.   The  other  variable  in  the  1960  model  was  population 
which  negatively  influences  the  vacant  land  use.   The  negative  in- 
fluence of  population  on  vacant  land  use  seems  reasonable,  because  to 
satisfy  the  needs  of  population  vacant  land  was  converted  to  the  higher 
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land  uses.   Intra-city  accessibility  alone  explains  all  the  variations 
in  the  1970  vacant  land  use  model. 

Qualitative  Comparison  of  the  Four  Equations 
Predicting  Commercial  Land  Use 
Industrial  land  use  was  positively  correlated  with  commercial 
land  use  in  the  1940  and  1950  models.   Population  was  positively 
correlated  with  commercial  land  use  in  the  1940  and  1970  models. 
Intra-city  accessibility  was  contained  only  in  the  1960  model  and,  by 
itself,  explains  all  the  variations  of  commercial  land  use.   The 
models,  although  different  from  each  other,  were  intuitively  correct 
and  in  agreement  with  the  hypothesized  assumptions. 

Qualitative  Comparison  of  the  Four  Equations 
Predicting  Industrial  Land  Use 
Intra-city  accessibility,  alone,  explains  all  the  variations  of 
industrial  land  use  in  the  1940,  1950  and  1960  models.   The  1970  model, 
in  addition  to  the  intra-city  accessibility,  contains  the  inter-city 
accessibility  to  explain  variations  of  the  industrial  land  use. 

Results  of  Qualitative  Comparisons  of  the  Four  Models 
Comparison  of  the  variables  used  to  explain  variation  of  land 
uses  by  the  four  models  showed  that  there  are  significant  differences 
in  the  way  the  four  models  explain  the  variations  of  land  uses  for  the 
four  decades  and  it  does  not  seem  feasible  to  combine  the  data  for  the 
four  decades  to  calibrate  an  overall  model. 
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Total  Effect  of  an  Independent  Variable  on 
a  Dependent  Variable 
The  individual  path  coefficients  of  Figure  18  to  21  represent 
change  of  the  dependent  variable,  measured  in  standard  deviation,  as 
direct  result  of  one  standard  deviation  change  in  the  independent 
variable.   But  the  independent  variables  also  affect  the  dependent 
variable  indirectly,  through  the  intermediatery  independent  variables. 
Direct  effect  of  an  independent  variable  on  the  dependent  variable  is 
biased  because  often  the  consideration  of  indirect  effects  of  the 
independent  variable  on  the  dependent  variable  may  result  in  a  total 
effect  different  from  the  direct  effect.   In  this  section  the  total 
effect  of  each  of  the  independent  variables  for  each  of  the  dependent 
variables,  for  the  1940,  1950,  1960  and  1970  data  sets,  was  calcuated. 

Procedure  for  Calculation  of  Total  Effects  of 
Independent  Variables  on  the  Dependent  Variables 
for  the  General  Path  Model 
The  principles  of  calculations  of  total  (direct  and  indirect) 
effect  of  an  independent  variable  on  a  dependent  variable  was  dis- 
cussed in  pages  29  to  30  for  the  four  variable  example  path  model  of 
Figure  4.  The   values  for  total  effect  of  an  independent  variable 
on  the  dependent  variable  is  obtained  by  writing  all  possible  paths 
between  the  two  variables,  multiplying  the  path  coefficients  associated 
with  each  link  of  the  path,  and  finally  summing  the  partial  effect 
coefficient  for  eacli  of  the  paths  to  lead  to  the  total  effects  of  the 
independent  variables  on  tlie  dependent  variables. 
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The  equations  used  in  calculations  of  total  effect  coefficients 
for  the  hypothesized  eleven  variable  path  model  are  quite  lengthy. 
The  path  model  variable  Z  affects  variable  Z   through  Z 
different  paths.   For  the  hypothesized  eleven  variable  model,  the 
equation  measuring  total  effects  of  variable  Z   on  variable  Z^  ^  con- 
tains Z         =  512  different  terms  because  variable  Z   effects 
variable  Z^  thru  512  different  paths.   Table  50  shows  number  of 
possible  paths  through  which  any  of  the  independent  variables  could 
influence  any  of  the  dependent  variables.   Comparison  of  generalized 
path  model  of  Figure  8  with  the  calibrated  models  of  Figure  18  to  21 
shows  many  of  the  paths  were  statistically  insignificant.   The  number 
of  terms  in  the  total  effect  equations  are  smaller  than  the  number 
given  in  Table  50.   Nevertheless  it  was  found  easier  to  solve  the 
general  problem  for  an  eleven  variable  model  and  to  substitute  zero 
for  values  of  insignificant  path  coefficients,  than  developing 
separate  solutions  for  the  four  models. 

The  general  solution  was  obtained  by  first  developing  the  set  of 
all  possible  paths  from  each  independent  variable  to  each  dependent 
variable.   The  above  set  was  formed  based  on  the  principle  that  all 
the  possible  paths  were  formed  by  all  possible  permutations  of  11 
numbers  with  the  important  constraint  that  the  numbers  had  to  be 
arranged  in  ascending  order.   The  set  which  is  presented  in  the 
Appendix  B  was  used  in  lieu  of  the  equations  of  total  effect  because 
once  all  the  possible  paths  between  the  independent  variables  and 
dependent  variables  were  known  corresponding  path  coefficient  could 
be  easily  assigned  to  each  link  of  the  paths  to  result  In  the 
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Table  50.   Number  of  Possible  Paths  Thru  Which  an  Independent  Variable 
May  Influence  a  Dependent  Variable  in  an  Eleven  Variable 
Path  Model 


Number  of  Paths 

Thru  Wlilch  The 

Independent  Variable 

Dependent            Independent 

Influences  The 

Variable               Variable 

Dependent  Variable 

(Zj)               (z,) 

Z(J-l-l) 

ZU                    21 

512 

22 

esG 

23 

128 

24 

B4 

25 

32 

2G 

16 

27 

8 

2G 

4 

29 

2 

210 

1 

210                     21 

asG 

22 

128 

23 

G4 

24 

32 

25 

IB 

2G 

B 

27 

4 

28 

2 

29 

1 

23                     21 

123 

22 

E4 

23 

32 

24 

IE 

25 

8 

2G 

4 

27 

2 

28 

1 

23                      21 

S4 

22 

32 

23 

16 

24 

8 

25 

4 

2E 

2 

27 

1 

27                     21 

32 

22 

16 

23 

8 

24 

4 

25 

S 

ZG 

1 

2G                     21 

IG 

22 

8 

23 

4 

24 

2 

25 

1 

25                     21 

8 

22 

4 

23 

2 

24 

1 

24                     21 

4 

22 

2 

23 

1 

23                     21 

2 

22 

1 

22                      21 

1 

TOTfiL  m.  OF  POSSIBLE  PATHS  IM  THE  11 

UARIOBLE  MODEL  -  203G 
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theoretically  correct  equations.   Secondly  by  use  of  data  presented 
in  Figures  18  to  21  four  other  data  sets  for  the  values  of  the  path 
coefficients  for  the  calibrated  19A0,  1950,  1960  and  1970  path  models 
were  prepared.   These  four  data  sets  contained  all  the  path  coeffi- 
cients; with  zero  substituted  for  the  path  coefficient  corresponding 
with  a  statistically  insignificant  link.   Finally  a  simple  computer 
program  was  employed  which  multiplied  the  path  coefficients  associated 
with  each  link  of  each  of  the  paths  and  summed  the  resultant  partial 
effect  coefficients  for  all  the  paths  between  an  independent  and 
dependent  variable. 

Results  Obtained  from  Calculation  of  Total  Effects  of 
Independent  Variables  on  Dependent  Variables 
Total  effect  coefficients  of  each  independent  variable  on  the 
dependent  variables  are  given  in  Column  3  of  Tables  51  to  54.   The 
interpretation  of  numerical  values  of  total  effects,  C.   ,  is  that, 
other  conditions  being  equal,  for  each  standard  deviation  change  in 
the  independent  variable  the  value  of  the  dependent  variable  is 
changed  by  C  .  .  standard  deviation  of  the  dependent  variable.   Column 
4  of  Tables  51  to  54  show  the  standard  deviation  of  measured  dependent 
variables.   Column  6  which  is  obtained  by  multiplication  of  values  of 
Column  3  by  Column  4  shows  the  value  of  change  in  each  dependent 
variable  for  one  standard  deviation  change  in  the  independent  variable. 
Standard  deviations  of  the  independent  variables  are  presented  in 
Column  5  of  these  same  tables  to  show  the  required  amount  of  change  in 
the  independent  variable  to  provide  the  change  shown  in  Column  6  of 
the  tables. 
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Results  of  analysis  of  total  effects  of  independent  variables  on 
independent  variable  showed  that  more  of  the  hypothesized  model's 
variables  effected  the  dependent  variables  than  was  evident  from 
results  of  calibrations  of  the  hypothesized  model  shown  in  Tables  41 
to  44.   The  reason  for  this  phenomenon  is  that  the  predictive  equations 
of  Tables  40  to  42  only  contain  those  independent  variables  which 
directly  affect  the  dependent  variable,  but  the  total  effect  analysis 
also  considered  those  variables  which  indirectly  affected  the 
dependent  variables. 

Results  obtained  from  total  effect  calculations  are  used  as  a 
tool  in  the  sensitivity  type  analysis  in  the  transportation  or  city 
planning  process.   Suppose  an  increase  in  a  city's  highway  system  is 
contemplated;  then  the  results  of  total  effect  analysis  can  be  used 
to  check  on  the  magnitude  of  other  land  uses  effected  by  the  change 
in  the  highway  system.   It  should  be  noted  that  all  the  independent 
variables  are  not  in  the  control  of  the  planner.   For  example,  the 
location  of  each  city  is  fixed  and  the  values  of  intcr-city 
accessibility  are  only  affected  by  the  statewide  planning  decisions, 
but  other  independent  variables  can  be  affected  by  the  city  wide 
planning  decisions.   Qualitative  comparison  of  the  four  calibrated 
models  showed  that  they  are  different  from  each  other.   For  current 
or  future  sensitivity  analysis  the  most  useful  information  is  con- 
tained in  the  results  of  total  effect  analysis  obtained  from  the  1970 
model.   Therefore  the  results  obtained  from  1970  total  effect 
calculations  is  analyzed  in  the  following  section. 
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Total  Effect  of  Independent:  Variables  on  Residential  Land  Use  -  1970 

Residential  land  use  is  negatively  affected  by  city  location  and 
positively  affected  by  inter-city  accessibility,  intra-city  accessi- 
bility, industrial  land  use,  city  population  and  commercial  land  use. 
City  location  as  used  in  the  model  is  a  dummy  variable  measured  with 
reference  to  a  horizontal  line  passing  through  Indianapolis.   There- 
fore the  implication  of  the  negative  direct  and  indirect  effect  of 
the  city  location  on  residential  land  use  is  that  other  conditions 
being  equal  1970  sample   cities  in  southern  Indiana  used  less  land 
for  residential  purposes  than  cities  in  northern  Indiana. 

The  implication  of  total  positive  effect  of  iiiLer-city  accessi- 
bility on  residential  land  use  is  that,  cities  wiLli  better  access  to 
other  population  centers  have  greater  residential  land  use  than  cities 
with  lesser  access  with  other  population  center.   It  should  be  noted 
that  the  above  statement  does  not  mean  the  sample  cities  with  greater 
inter-city  accessibility  use  a  greater  portion  of  their  land  for 
residential  purpose,  but  means  that  among  1970  sample  cities  for  every 
increase  of  one  standard  deviation  inter-city  accessibility  the 
residential  land  use  was  increased  by  1.65  standard  deviation  of 
residential  land  use.   Most  cities  have  flexibility  in  enlarging  the 
city  limits,  and  another  way  of  looking  at  the  results  was  that  the 
overall  influence  of  increase  in  inter-city  accessibility  was  increase 
in  the  size  of  city.   Conversely  the  overall  influence  of  city 
location  in  the  southern  Indiana  was  for  the  cities  to  remain  small. 
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Among  the  variables  in  the  control  of  city  planner,  positive 
effect  of  intra-city  accessibility  on  residential  land  use  was  sub- 
stantial, while  positive  effect  of  industrial  land  use,  city  popula- 
tion, and  commercial  land  use  were  moderate. 

Total  Effect  of  Independent  Variables  on  Institutional  Land  Use  -  1970 

Institutional  land  use  is  negatively  affected  by  city  location 
and  positively  affected  by  inter-city  accessibility,  intra-city 
accessibility,  industrial  land  use,  employment  per  1000  persons,  city 
population  and  vacant  land  use.   Among  the  variables  somewhat  in  the 
control  of  the  planning  process  intra-city  accessibility,  and 
industrial  land  use  had  substantial  influence  in  the  institutional 
land  use.   Influence  of  employment  rate  on  institutional  land  use  is 
small. 

Total  Effect  of  Independent  Variables  on  Vacant  Land  Use  -  1970 

City  location  affects  vacant  land  use  negatively;  while  inter- 
city accessibility  and  intra-city  accessibility  affect  vacant  land 
positively.   Vacant  land  usually  is  the  land  reserved  for  future  use 
therefore  the  implication  of  results  obtained  is  that  cities  located 
in  southern  Indiana  had  the  tendency  to  remain  smaller  than  cities  in 
northern  Indiana;  and  higher  accessibilities  had  expansionary  effects 
on  the  sample  cities. 

Total  Effect  of  Independent  Variables  on  Commercial  Land  Use  -  1970 

City  location  affects  commercial  land  use  negatively;  while  inter- 
city accessibility,  intra-city  accessibility,  industrial  land  use  and 
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city  population  affect  commercial  land  use  positively.   The  implica- 
tion of  negative  affect  of  city  location  on  commercial  land  use  was 
that  the  sample  cities  in  southern  Indiana  had  less  land  used  for 
commercial  land  use. 

Total  Effect  of  Independent  Variables  on  Industrial  Land  Use 

City  location  affects  industrial  land  use  negatively.   This 
implies  that  cities  in  southern  Indiana  had  smaller  industrial  land 
use  than  northern  cities.   While  inter-city  accessibility  and  intra- 
city  accessibility  had  substantial  positive  effect  on  industrial 
land  use. 

References 

1.  Campbell,  D.  T.  ,  and  Stanley,  J.  C,  Experimental  and  Quasi 
Experimental  Design  for  Research,  Rad  McNally  &  Co.,  Chicago, 
1966. 

2.  State  Planning  Services  Agency,  "A  Directory  of  Indiana  Local  and 
Regional  Planning  and  Development  Commissions",  Indianapolis,  June 
1978,  p.  5. 

3.  U.S.  Bureau  of  Census,  U.S.  Census  of  Population:  1970,  Vol.  1, 
Characteristics  of  the  Population,  Part  16,  Indiana,  U.S. CO. P., 
Washington,  D.  C,  April  1973. 

4.  U.S.  Bureau  of  Census,  U.S.  Census  of  Population:   1960,  Vol.  1, 
Characteristics  of  the  Population,  Part  16,  Indiana,  U.S.  Govern- 
ment Printing  Office,  Washington,  D.  C,  1963. 

5.  U.S.  Bureau  of  Census,  U.S.  Census  of  Population:   1950,  Vol.  2, 
Characteristics  of  the  Population,  Part  14,  U.S.  Government 
Printing  Office,  Washington,  D.  C,  n.d. 

6.  U.S.  Bureau  of  Census,  U.S.  Census  of  Population:  1940,  Vol.  II, 
Characteristics  of  the  Population,  Part  2,  Florida-Iowa,  U.S. 
Government  Printing  Office,  Washington,  D.  C,  1943. 

7.  Ibid,  pp.  842-844. 


161 


8.  Land,  Kenneth,  "Principal  of  Path  Analysis",  pp.  3-37  in  Edgar 
Borgotta  (ed.).   Sociological  Methodology,   1969,  San  Francisco, 
Jossey  Bass,  p.  9. 

9.  Shafran,  Isaac  and  Wegman,  Frederic  J.,  "The  Influence  of  the 
Highway  Network  Structure  on  the  Economic  Development  of  West 
Virginia",  Transportation  Research  Board.  Record  285,  1969- 

10.  Hejal,  S.  S. ,  Traffic  Speed  Report  No.  88,  Joint  Highway  Research 
Project,  Purdue  University,  Lafayette,  Oct.  1970. 

11.  Tharp,  K.  J.,  and  Petty,  D.  F.,  Traffic  Speed  Report  No.  74,  Joint 
Highway  Research  Project,  Purdue  University,  Lafayette,  Nov.  1961. 

12.  Lang,  E.  R.  and  Michael,  H.  L. ,  Traffic  Speed  Report  No.  44,  Joint 
Highway  Research  Project,  Purdue  University,  Lafayette,  May  1950. 

13.  Frost,  R.  E.,  "Indiana  Wartime  Traffic  Speeds",  in  Proceedings  of 
the  Twenty-Ninth  Annual  Road  School,  Lafayette,  Jan.  1943, 

pp.  99-106. 

14.  Vodrazka,  Walter  Charles, "Subclassification  of  the  State  Highway 
System  of  Indiana  Based  on  Synthesis  of  Intercity  Travel," Ph.D. 
Thesis,  Purdue  University,  W.  Lafayette,  June  1968. 

15.  Indiana  State  Highway  Commission,  1940  Indiana  State  Highway  System 
(Scale  1  inch  =  10  miles),  Indianapolis,  1940. 

16.  Indiana  State  Highway  Commission,  1950  State  Highway  System  of 
Indiana  (Scale  1  inch  =  10  miles),  Indianapolis,  1950. 

17.  Indiana  State  Highway  Department,  1960  State  Highway  System  of 
Indiana  (Scale  1  inch  =  10  miles),  Indianapolis,  1960. 

18.  Indiana  State  Highway  Commission,  1970-1971  Indiana  State  Highway 
(Scale  1  inch  =  10  miles),  Indianapolis,  1971. 

19.  Federal  Writers'  Project  Works  Progress  Administration,  Illinois 
1939  (Highway  Map),  (Scale  1  inch  =  12  miles),  n.p.,  n.d. 

20.  "Highway  Map  of  Illinois",  Standard  Highway  Mileage  Guide  (Scale 
1  inch  =  16  miles).  Rand  McNally  &  Company,  1949,  pp.  230-231. 

21.  State  of  Illinois,  1961  Illinois  Official  Highway  Map,  (Scale 
1  inch  =  12.5  miles),  Springfield,  1961. 

22.  Illinois  Division  of  Highways,  1970  Official  Highway  Map  (Scale 
1  inch  =  12  miles),  Springfield,  1970. 


162 


23.  Kentucky  Department  of  Highways,  Official  (1941)  Kentucky  Road 
Map,  (Scale  1  inch  =  12  miles),  Dec.  1941,  n.p. 

24.  Kentucky  Department  of  Highways,  Official  Kentucky  Road  Map  (Scale 
1  inch  =  12  miles),  Frankfort,  April  1950. 

25.  Kentucky  Department  of  Highways,  1960  Official  Kentucky  Depart- 
ment of  Highways  Road  Map,  Frankfort,  n.d. 

26.  Kentucky  Department  of  Highways,  Kentucky  Department  of  Highways 
Map  -  August  1970  (Scale  1  inch  =  12  miles),  Frankfort,  1970. 

27.  Michigan  State  Highway  Department,  Official  Michigan  Highway  Map 
(Scale  1  inch  =  14.5  miles),  Winter  Edition  1939-1940,  1939, 
Lansing. 

28.  Michigan  State  Highway  Department,  Michigan  Official  Highway  Map, 
1950  (Scale  1  inch  =  14.5  miles),  Lansing,  April  1950. 

29.  Michigan  State  Highway  Department,  Michigan  1960  Official  Highway 
Map  (Scale  1  inch  =  14.5  miles),  Lansing. 

30.  Michigan  State  Highway  Commission,  Michigan  1970  Official  Highway 
Map  (Scale  1  inch  =  14.5  miles),  Lansing,  n.d. 

31.  U.S.  Geological  Survey, "The  Federal  Aid  Highway  System  Program 
Map,"l  inch  =  7.9  miles,  map  1  (northern  Ohio)  and  Map  2  (southern 
Ohio),  n.p. ,  June  1,1939. 

32.  Ohio  Department  of  Highways,  Ohio  Official  Highway  Map,  1959-1960, 
(Scale  1  inch  =  10  miles),  n.d.,  n.p. 

33.  Ohio  Department  of  Highways,  Ohio  Official  Highway  Maps,  1950-1960, 
(Scale  1  inch  =  10  miles),  n.p.,  1960. 

34.  Ohio  Department  of  Highways,  1969  Official  Ohio  Highway  Map  and 
Economic  Digest,  (Scale  1  inch  =  10  miles),  Columbus,  n.d. 

35.  Vodrazka,  Walter  Charles,  Ibid.,  p.  92. 

36.  Homburger,  Wolfgang  S., "Traffic  Estimation  Computer  Programs  for 
Educational  Purposes  -  3rd  Edition','  Institute  of  Transportation 
and  Traffic  Engineering,  University  of  Californai,  Berkeley, 
Oct.  1970,  pp.  31-35. 

37.  Ibid,  pp.  40-45. 

38.  "ASCS  Aerial  Photography  Status  Maps  as  of  December  31,  1974", 
Agricultural  Stabilization  and  Conservation  Service,  U.S.  Depart- 
ment of  Agriculture,  Salt  Lake  City,  Utah. 


163 


39.  Miles,  Robert  D.,  "Availability  of  Aerial  Ptiotographs" ,  in 
Standards  of  Practice  for  Indiana  Society  oC  Professional  Land 
Surveyors ,  sec.  VIII,  Item  6,  pp.  13-20,  n.d. 

40.  Davis,  Jeane  M.  ,  "Uses  of  Airphotos  for  Rural  and  Urban  Planning", 
Agricultural  Handbook  No.  315,  U.S.  Department  of  Agriculture, 
June  1966,  p.  9. 

41.  Hadfield,  Sam  M.  and  Orzeske,  John  D.,  "A  Sampling  Technique  for 
Updating  a  Quantitative  Land  Use  Inventory",  Transportation 
Research  Board,  Record  126,  1966,  p.  81. 

42.  Falkner,  E.,  "Land  Use  Changes  in  the  Parkway  School  District", 
Photogrammetric  Engineering,  XXXIV,  Jan.,  1968,  pp.  52-57. 

43.  U.S.  Department  of  Agriculture,  "Sampling,  Coding  and  Storing  Flood 
Plain  Data",  Agriculture  Handbook  No.  237,  August  1962. 

44.  Ibid. 

45.  Branch,  Melville  C. ,  City  Planning  and  Aerial  Information,  Harvard 
University  Press,  Cambridge,  1971. 

46.  Manual  of  Photographic  Interpretation,  American  Society  of  Photo- 

grammetry,  1960. 

47.  Mgbejume,  Joel  Keboku,  "Application  of  the  Techniques  of  Aerial 
Photographic  Interpretation  in  the  Analysis  of  Land  Use  Changes 
in  GreenTree Borough,  Pennsylvania",  Masters  Thesis,  University  of 
Pittsburgh,  Pittsburgh,  1962. 

48.  Pownall,  L.  L. ,  "Airphoto  Interpretation  of  Urban  Use  in  Madison, 
Wisconsin",  Photogrammetric  Engineering,  1950,  pp.  414-426. 

49.  Collins,  W.  Gordon,  et  al,  "The  Acquisition  of  Urban  Land  Use 
Information  from  Aerial  Photographs  of  the  City  of  Leads  (Great 
Britain)',  Photographic  Engineering.  Vol.  27,  1971,  pp.  71-92. 

50.  Letter  from  Meyer  Zitter,  Chief  Population  Division,  Bureau  of 
Census,  to  Hadi  Yamin,  Purdue  University,  dated  Sept.  27,  1977. 

51.  Hutchinson,  B.  G. ,  Principles  of  Urban  Transport  Systems  Planning, 
Scripta  Book  Company,  Washington,  D.  C. ,  1974,  p.  58. 

52.  Draper,  N.  R.  and  Smith,  H. ,  Applied  Regression  Analysis,  John 
Wiley  &  Sons,  Inc.,  New  York,  1966,  pp.  167-169. 

53.  Nie,  Norman,  et  al.  Statistical  Package  for  the  Social  Sciences, 
McGraw-Hill  Book  Company,  New  York,  1975. 


164 


54.  Draper,  N.  R.  and  Smith,  H.,pp.  21-24  and  pp.  121-122. 

55.  "Confidence  and  Prediction  Intervals  for  Y-Prograra  Name:   CPIYL", 
maintained  by  Purdue  University  Statistics  Department , Nov. ,  1974. 

56.  "SPSS  Regression  Reference,  CS  SPSS  15",  Purdue  University 
Computer  Center,  Nov.  1976,  p.  16. 


165 


CHAPTER  FIVE 
CONCLUSION 

A  path  model  was  hypothesized  to  explain  the  relationship  of 
transportation  development  and  city  development  for  non-SMSA  Indiana 
cities  whose  population  was  less  than  50,000. 

1.  Data  from  1940,  1950,  1960  and  1970  were  used  to  calibrate 
the  path  model  for  the  four  decades.   The  results  of  the 
calibrated  models  which  were  presented  in  Tables  41  to  44  as 
well  as  Figures  18  to  21  are  summarized  in  Table  55.   This 
table  shows  the  percent  of  the  variations  of  land  uses  ex- 
plained by  the  twenty  equations  (five  land  uses  in  four  time 
periods).   Six  of  these  equations  explained  more  than  90  per- 
cent of  land  use  variation  measured  by  them.   Nine  of  tlie 
equations  explained  73  to  88  percent  of  the  measured  variations; 
while  the  remaining  five  equations  measured  44  to  54  percent  of 
measured  land  use  variations. 

2.  Three  cities,  one  from  each  city  size  classification,  were 
removed  from  each  of  the  four  data  sets  and  the  four  path 
models  were  recalibrated  with  the  use  of  the  remainder  of 
data.   It  was  found  that  recalibrated  models  could  estimate 
the  land  uses  for  the  cities  not  used  in  their  calibration 
with  a  ninety-five  percent  significant  level. 
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Table  55.   Summary  of  Results  Obtained  from  Calibration  of  1940,  1950, 
1960  and  19  70  Path  Models 


Percent  of  Variations  Explained  by  the  Path  Model (R^) 
19A0        1950        1960        1970 
Independent  Variables       Model       Model       Model       Model 

Residential  Land  Use 
Institutional  Land  Use 
Vacant  Land  Use 
Commercial  Land  Use 
Industrial  Land  Use 


Note:   a  =  (1-signif icance  level)  for  all  equations  is  0.00  except  as 
noted  by  (*)  where  a  =  0.03. 


93. 4A 

95.40 

98.26 

97.70 

77.47 

75.89 

96.50 

94.13 

74.87* 

44.22 

86.93 

82.29 

49.10 

72.97 

41.79 

54.45 

52.31 

82.87 

82.26 

88.52 
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3.  The  four  reduced  path  models  obtained  from  calibration  of  the 
liypothesi zed  transportation-city  development  path  model  were 
qualitatively  compared  and  it  was  determined  the  four  models 
are  generally  different,  and  it  was  not  feasible  to  combine 
the  data  for  the  four  decades  to  calibrate  an  overall  model. 

4.  Total  effects  coefficients  were  calculated  for  the  four  models. 
These  coefficients  gave  the  total  change,  in  standard  devia- 
tions, of  dependent  variables  for  one  standard  deviation 
change  in  each  independent  variable. 

The  results  obtained  from  this  research  are  useful  as  transporta- 
tion and  city  planning  tools  for  those  non-SMSA  cities  with  population 
greater  than  2,500  and  less  than  50,000. 

1.  Although  the  calibrated  path  models  do  not  include  time  as  a 
variable  it  is  hoped  that  the  calibrated  1970  model  can  be 
used  to  estimate  land  use  changes  as  consequences  of  future 
planning  decisions.   To  test  the  feasibility  of  this  procedure; 
the  five  land  use  variables  for  the  2A  sample  cities  measured 
in  1970  were  estimated  with  the  use  of  1960  model.   The  results 
of  this  test  which  is  presented  in  Appendix  C-1  to  C-5  showed 
that  from  a  total  of  120  observations;  106  were  within  their 
corresponding  95  percent  confidence  interval  as  estimated  by 
the  1960  model. 

2.  Total  effect  coefficients  obtained  from  the  1970  calibrated 
path  model  are  a  powerful  planning  tool  because  the  total 
effect  of  a  proposed  change  in  a  dependent  variable  (while 
controlling  for  all  other  variables)  is  given  by  the  total 
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effect  coefficient.   The  simplicity  of  use  of  total  effect 
coefficients  may  make  them  very  useful  in  environmental  impact 
studies  of  the  proposed  transportation  or  industrial  projects. 
3.   This  research  demonstrated  applicability  of  cross  sectional 
analysis  in  conjunction  with  the  statistical  path  model  to 
explain  the  relationship  of  transportation  development  to 
urban  development.   It  is  believed  the  approach  in  the  long 
run  is  more  fruitful  than  the  traditional  before  and  after 
studies. 
It  is  recommended  that  similar  research  in  other  geographical 
areas  be  carried  out  to  enlarge  the  inference  space  of  the  experiment. 
It  would  be  helpful  for  the  better  understanding  of  the  relationship  of 
transportation  and  city  development  to  compare  the  results  of  similar 
model  development  for  other  states  with  the  results  obtained  in  this 
research. 

Two  areas  of  improvement  are  anticipated  for  future  research  in 
the  relationship  of  transportation  and  city  development: 

1.   In  the  current  project  data  was  collected  for  the  four 

different  periods  of  1940,  1950,  1960,  and  1970.   The  trans- 
portation-city development  path  model  was  calibrated  for  each 
of  the  four  periods  and  it  was  found  that  in  all  the  four 
periods  transportation  development  had  a  significant  effect 
in  the  city  development,  but  qualitatively  it  was  decided 
that  the  four  reduced  models  were  different  from  each  other. 
The  four  calibrated  models  could  not  be  quantitatively  com- 
pared because  tlicre  were  only  about  24  sample  cities  in  each 
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period.   One  possible  way  to  quantitatively  compare  the  four 
models  was  to  accept  time  and  time's  interaction  with  the  11 
hypothesized  variables  as  new  variables  and  test  for  signifi- 
canse  of  time  and  time  interactions.   But  then  there  would  be 
23  variables  in  the  model  and  there  were  not  sufficient  data 
points  to  test  the  statistical  significance  of  the  variables. 
It  is  recommended  that  enough  sample  points  be  chosen  to  test 
for  time  and  its  interaction  or  if  the  project  resources  do 
not  allow  sufficient  sample  points  for  more  than  one  time 
period  to  concentrate  on  the  latest  period  for  which  data  can 
be  collected.   For  example,  in  this  research  the  most  useful 
part  of  the  research  was  the  effort  used  on  analysis  of  the 
1970  data. 

It  is  hoped  that  future  research  Improves  on  the  operational 
definition  of  intra-city  accessibility  as  used  in  this  project 
(acres  of  land  used  for  highways).   The  usual  definition  of 
intra-city  accessibility, 

n 

ace .  =  Z  a^  f .  ^ 
^   J=l  '     ^' 

as  discussed  in  page  94  is  not  conveniently  calculated  for 

this  type  of  project;  where  the  city  as  a  whole  is  the  sample 

point  and  information  about  inter-zonal  travel  is  not  readily 

available. 
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The  Appendices  are  not  included  in  this  copy  of  the  Report.   The 
three  (3)  Appendices  are  as  follows: 


Appendix  A  -  Program  Used  to  Calculate  Interaction 

Potential  and  Interaction  Potential  Tables 

for  1940,  1950,  1960  and  1970  pp.    176-208 

Appendix  B  -  Set  of  All  Possible  Paths  from  Each 

Independent  Variable  to  Each  Dependent 

Variable  pp.    209-238 

Appendix  C  -  Tables  of  Estimated  Land  Uses  from  the 

1960  Model  pp.    239-243 


A  copy  of  each  of  the  above  Appendices  may  be  obtained  at  the  cost  of 

duplication  by  inquiry  to: 

Joint  Highway  Research  Project 
Civil  Engineering  Building 
Purdue  University 
West  Lafayette,  IN  47907 
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